Every transcription passes through three matching tiers. Only word-boundary matches are applied — partial-word replacements are prevented.
O(1) HashMap lookup for single-word corrections and multi-word phrase matching. Fastest tier — handles known misspellings and abbreviation expansions instantly.
k8s→KubernetesEdit distance-based matching (configurable 0-3 distance, default 2) using prefix-based indexing. Catches typos and mishearings while a spell validator gate prevents correcting valid English words.
tenserflow→TensorFlowCatches homophones and similar-sounding words that edit distance alone would miss. Essential for voice dictation where words are heard, not typed.
their/there/they're→Correct form based on contextPrompt words go deeper than post-processing — they bias the transcription engine itself toward specific vocabulary before any text is produced.
For the Whisper backend, prompt words are passed as "previous context" so the model expects these terms. For Parakeet, they're compiled into CTC vocabulary models that boost probability at the acoustic decoder level.
Organize corrections by category (medical, legal, programming, names) for easy management.
See how often each correction is applied. Identify which entries are most valuable.
Import and export your dictionary as JSON. Share word lists across devices or teams.
Premium bundled correction databases with per-pack enable/disable and automatic version updates.
Prompt words bias transcription toward specific vocabulary — proper nouns, technical terms, brand names. For Whisper, they're passed as 'previous context'. For Parakeet, they're compiled into CTC vocabulary models that boost probability at the acoustic decoder level.
No. The spell validator gate ensures that SymSpell fuzzy matches are validated against a spell checker — valid English words are not 'corrected' by fuzzy matching. Only genuinely misspelled or misheard words are fixed.
Yes. In the live preview, corrected words are shown with gradient color highlighting. Click any highlighted word to see the original transcription before correction.
Dictionary entries fix text after transcription (post-processing). Prompt words influence the transcription engine itself, biasing it to recognize specific vocabulary during the acoustic decoding phase.