Audio API - MythicDot.AI

"Hello, this is a demo of the MythicDot audio API..."

Audio APIs

🎤 Speech-to-Text

Convert audio recordings to accurate text transcriptions with automatic language detection.

50+ Languages Timestamps Diarization

🔊 Text-to-Speech

Generate natural-sounding speech from text with multiple voice options and styles.

6 Voices HD Quality Streaming

🌐 Translation

Translate audio from one language to English text in a single API call.

Many→English High Accuracy

👂 Audio Understanding

Analyze audio content directly with our multimodal models (coming soon).

Coming Soon

Speech-to-Text

Transcribe audio with high accuracy using our Whisper-based model:

                    Python
                

from mythicdot import MythicDot

client = MythicDot()

# Transcribe audio file
with open("audio.mp3", "rb") as audio_file:
    transcription = client.audio.transcriptions.create(
        model="mythic-stt-1",
        file=audio_file,
        response_format="verbose_json",  # Includes timestamps
        timestamp_granularities=["word", "segment"]
    )

print(transcription.text)
# Access word-level timestamps
for word in transcription.words:
    print(f"{word.start:.2f}s: {word.word}")
                

Supported Audio Formats

🎵

MP3

.mp3

🎵

WAV

.wav

🎵

FLAC

.flac

🎵

M4A

.m4a

🎵

OGG

.ogg

🎵

WebM

.webm

🎵

MPEG

.mpeg

🎵

MPGA

.mpga

Text-to-Speech

Generate natural-sounding speech from text:

                    Python
                

from mythicdot import MythicDot

client = MythicDot()

# Generate speech
response = client.audio.speech.create(
    model="mythic-tts-1-hd",
    voice="nova",
    input="Welcome to MythicDot AI. Let's build something amazing together.",
    response_format="mp3"
)

# Save to file
response.stream_to_file("output.mp3")
                

Available Voices

A

Alloy

Neutral and balanced

E

Echo

Deep and authoritative

F

Fable

Warm and expressive

O

Onyx

Deep and resonant

N

Nova

Friendly and conversational

S

Shimmer

Bright and clear

Pricing

Model	Description	Price
mythic-stt-1	Speech-to-text transcription	$0.006 / minute
mythic-tts-1	Standard text-to-speech	$0.015 / 1K chars
mythic-tts-1-hd	High-definition TTS	$0.030 / 1K chars