Speech to Text

Transcribe speech to text with the world’s most accurate ASR model

Achieve industry-leading transcription accuracy in 8 languages with Scribe, featuring character-level timestamps, speaker diarization, and audio-event tagging—all delivered in a structured API response for seamless integration

Upload an audio/video file to generate a transcript