Watch words appear as you talk

Text shows up while you're still speaking. A dedicated streaming engine on the Neural Engine handles preview with ~300ms latency. When you release the record button, the main engine delivers the final polished result.

Two-Tier Streaming Architecture

The preview model is separate from the main engine. You get instant visual feedback without any hit to final accuracy.

Primary: EOU Engine

An NVIDIA End-of-Utterance model on the Neural Engine processes 320ms audio windows and produces word-level partials with ~300ms latency.

Neural Engine | 320ms windows | ~300ms latency
Fallback: StreamingTranscriber

If the EOU model isn't available, preview falls back to the main engine processing 2-second chunks during recording.

Metal GPU | 2s chunks | Used as fallback

Live Preview Features

~300ms Latency

The EOU engine chews through 320ms audio windows on the Neural Engine and spits out word-level partials almost immediately.

Dual-Engine Architecture

Preview runs on the Neural Engine. Main transcription runs on Metal GPU. They don't fight over resources.

Typewriter Animation

Words appear progressively with a subtle typewriter effect. Feels natural, not jarring.

Keyword Highlighting

Dictionary corrections show up with colored highlights. Click one to see what the original transcription was.

Technical Details

Audio buffering
5120 samples (320ms)
Preview latency
~300ms
EOU hardware
Apple Neural Engine
Main engine
Metal GPU (separate)
Waveform bars
20-bar real-time display
Resource usage
Toggleable to save power

Frequently Asked Questions

How fast is the live preview?

About 300ms latency. The EOU engine processes 320ms audio windows on the Neural Engine, faster than any cloud service.

Does live preview affect final accuracy?

No. It's a separate lightweight model. The main engine runs independently and produces the final result when you release the button.

Can I disable live preview to save battery?

Yes. Toggle it off in Settings. The main transcription engine still works; you just won't see words appearing in real-time.

Does live preview work with all engines?

The primary EOU preview needs the Neural Engine (Apple Silicon). If unavailable, it falls back to the main engine with 2-second chunks.

Words appear instantly

See what you're saying while you're still saying it.

Try it.

Pay once. Keep it forever. Nothing goes to the cloud.

Free trial included. Pro Pack $14.99 lifetime.