Convert speech to text with industry-leading accuracy, and generate lifelike speech from text. Build voice-enabled applications with ease.
Transcribe audio files or real-time audio streams with state-of-the-art accuracy. Supports 50+ languages with automatic detection.
Generate natural, lifelike speech from text. Choose from 6 voices with different styles and personalities.
Six unique voices designed for different use cases.
Warm, professional
Friendly, conversational
Deep, authoritative
Expressive, storytelling
Clear, technical
Bright, energetic
from mythicdot import MythicDot client = MythicDot() # Transcribe an audio file with open("meeting.mp3", "rb") as audio_file: transcript = client.audio.transcriptions.create( model="mythic-whisper-1", file=audio_file, response_format="verbose_json", timestamp_granularities=["word"] ) print(transcript.text) # Access word-level timestamps for word in transcript.words: print(f"{word.word} [{word.start}s - {word.end}s]") # Real-time streaming transcription async for chunk in client.audio.transcriptions.stream( model="mythic-whisper-1", audio_stream=microphone_stream ): print(chunk.text, end="", flush=True)
Start building voice-enabled experiences today. Free tier includes 60 minutes of transcription per month.