light estimateLast updated 2026-06-19

Best transcription API with custom vocabulary

When transcripts are full of product names, medical or legal jargon, or unusual spellings, custom vocabulary lifts accuracy. ElevenLabs Scribe v2 leads the offerings supporting phrase lists or keyterm boosting, with AssemblyAI Universal-3 Pro, Deepgram Nova-3 and Speechmatics Enhanced close behind. OpenAI's base model only allows prompt-biasing, not a real phrase list, and Gemini has no dedicated control. Vocabulary handling stays unbenchmarked, so accuracy sets the order. A light estimate from documentation.

DefaultElevenLabs Scribe v2met custom vocabularyElevenLabs Scribe v2
Provider offerings compared on Price, WER, Langs, Latency and capabilities
OfferingPrice ($/1000 min)WERLangsLatencyCapabilities
ElevenLabs Scribe v2ElevenLabs3.672.2%90150 msdiarizationtimestampsvocab
AssemblyAI Universal-3 ProAssemblyAI3.53.1%99150 msdiarizationtimestampsvocab
Deepgram Nova-3Deepgram4.35.2%50300 msdiarizationtimestampsvocab
Speechmatics EnhancedSpeechmatics6.74%70500 msdiarizationtimestampsvocab
OpenAI gpt-4o-transcribeOpenAI64%
Google Gemini 3 FlashGoogle1.92*2.9%

* token-/credit-priced — the headline understates real per-unit cost, so it is excluded from the cheapest ranking.