Audio API

Transform audio to text and text to speech. Build voice-enabled applications with production-ready APIs.

"Hello, this is a demo of the MythicDot audio API..."

Audio APIs

🎀 Speech-to-Text

Convert audio recordings to accurate text transcriptions with automatic language detection.

50+ Languages Timestamps Diarization

πŸ”Š Text-to-Speech

Generate natural-sounding speech from text with multiple voice options and styles.

6 Voices HD Quality Streaming

🌐 Translation

Translate audio from one language to English text in a single API call.

Many→English High Accuracy

πŸ‘‚ Audio Understanding

Analyze audio content directly with our multimodal models (coming soon).

Coming Soon

Speech-to-Text

Transcribe audio with high accuracy using our Whisper-based model:

Python
from mythicdot import MythicDot client = MythicDot() # Transcribe audio file with open("audio.mp3", "rb") as audio_file: transcription = client.audio.transcriptions.create( model="mythic-stt-1", file=audio_file, response_format="verbose_json", # Includes timestamps timestamp_granularities=["word", "segment"] ) print(transcription.text) # Access word-level timestamps for word in transcription.words: print(f"{word.start:.2f}s: {word.word}")

Supported Audio Formats

🎡
MP3
.mp3
🎡
WAV
.wav
🎡
FLAC
.flac
🎡
M4A
.m4a
🎡
OGG
.ogg
🎡
WebM
.webm
🎡
MPEG
.mpeg
🎡
MPGA
.mpga

Text-to-Speech

Generate natural-sounding speech from text:

Python
from mythicdot import MythicDot client = MythicDot() # Generate speech response = client.audio.speech.create( model="mythic-tts-1-hd", voice="nova", input="Welcome to MythicDot AI. Let's build something amazing together.", response_format="mp3" ) # Save to file response.stream_to_file("output.mp3")

Available Voices

A
Alloy

Neutral and balanced

E
Echo

Deep and authoritative

F
Fable

Warm and expressive

O
Onyx

Deep and resonant

N
Nova

Friendly and conversational

S
Shimmer

Bright and clear

Pricing

Model Description Price
mythic-stt-1 Speech-to-text transcription $0.006 / minute
mythic-tts-1 Standard text-to-speech $0.015 / 1K chars
mythic-tts-1-hd High-definition TTS $0.030 / 1K chars