Every transcription runs through three matching stages. Corrections only apply at word boundaries, so you won't get mangled partial replacements.
HashMap lookup for single words and multi-word phrases. Handles known misspellings and abbreviation expansions in O(1) time.
k8s→KubernetesEdit-distance matching with prefix indexing (configurable 0-3 distance). A spell validator gate keeps valid English words from being "corrected."
tenserflow→TensorFlowCatches homophones that edit distance misses. When you say "their" but mean "there," this tier handles it.
their/there/they're→Correct form based on contextUnlike dictionary corrections, prompt words influence recognition before text is produced. The engine expects these terms and is more likely to hear them correctly.
Whisper treats prompt words as prior context. NVIDIA compiles them into CTC vocabulary models that boost acoustic decoder probability. Same result: better recognition for your specific vocabulary.
Group corrections by domain: medical, legal, programming, names. Makes large dictionaries manageable.
See which corrections fire most often. Helps you spot what matters and prune what doesn't.
JSON import/export for sharing word lists across devices or with teammates.
Pre-built correction databases you can toggle on or off. Auto-updated when new versions ship.
Prompt words bias transcription toward specific vocabulary — proper nouns, technical terms, brand names. For Whisper, they're passed as 'previous context'. For NVIDIA, they're compiled into CTC vocabulary models that boost probability at the acoustic decoder level.
No. The spell validator gate ensures that SymSpell fuzzy matches are validated against a spell checker — valid English words are not 'corrected' by fuzzy matching. Only genuinely misspelled or misheard words are fixed.
Yes. In the live preview, corrected words are shown with gradient color highlighting. Click any highlighted word to see the original transcription before correction.
Dictionary entries fix text after transcription (post-processing). Prompt words influence the transcription engine itself, biasing it to recognize specific vocabulary during the acoustic decoding phase.