Universal-Streaming - Ultra-fast, ultra-accurate streaming STT for voice agents.
Universal-Streaming delivers all the streaming speech-to-text voice agents need in one robust API: ultra-fast immutable transcripts, higher accuracy, built-in endpointing, and transparent pricing at $0.15/hour with unlimited concurrency.
AssemblyAI - Speech-to-Text API - APIs to automatically transcribe and understand audio
Universal 2 - Speech-to-text for conversational data
Introducing Universal-2: The latest advancement in Speech-to-Text technology. Capture the complexity of human speech, enhanced transcript quality, and better conversational insights by tapping into the next generation of Speech AI.
Universal-1 - Multilingual speech AI model trained on 12.5M hours of data
Try AssemblyAI's most capable and highly trained speech recognition model trained on 12.5M hours of multilingual audio data. Universal-1 achieves best-in-class speech-to-text accuracy, reduces word error rate and hallucinations, and improves timestamps.
AssemblyAI Auto Chapters - Automatically summarize audio and video files with an API
With Auto Chapters by AssemblyAI, you can generate an automatic "summary over time" for your audio and video files as the topic of conversation changes.