Watch words appear as you talk

Text shows up while you're still speaking. A dedicated streaming engine on the Neural Engine handles preview with ~300ms latency. When you release the record button, the main engine delivers the final polished result.

Two-Tier Streaming Architecture

The preview model is separate from the main engine. You get instant visual feedback without any hit to final accuracy.

Primary: EOU Engine

An NVIDIA End-of-Utterance model on the Neural Engine processes 320ms audio windows and produces word-level partials with ~300ms latency.

Neural Engine | 320ms windows | ~300ms latency

Fallback: StreamingTranscriber

If the EOU model isn't available, preview falls back to the main engine processing 2-second chunks during recording.

Metal GPU | 2s chunks | Used as fallback

Live Preview Features

~300ms Latency

The EOU engine chews through 320ms audio windows on the Neural Engine and spits out word-level partials almost immediately.

Dual-Engine Architecture

Preview runs on the Neural Engine. Main transcription runs on Metal GPU. They don't fight over resources.

Typewriter Animation

Words appear progressively with a subtle typewriter effect. Feels natural, not jarring.

Keyword Highlighting

Dictionary corrections show up with colored highlights. Click one to see what the original transcription was.

Technical Details

Audio buffering

5120 samples (320ms)

Preview latency

~300ms

EOU hardware

Apple Neural Engine

Main engine

Metal GPU (separate)

Waveform bars

20-bar real-time display

Resource usage

Toggleable to save power

Frequently Asked Questions

How fast is the live preview?

About 300ms latency. The EOU engine processes 320ms audio windows on the Neural Engine, faster than any cloud service.

Does live preview affect final accuracy?

No. It's a separate lightweight model. The main engine runs independently and produces the final result when you release the button.

Can I disable live preview to save battery?

Yes. Toggle it off in Settings. The main transcription engine still works; you just won't see words appearing in real-time.

Does live preview work with all engines?

The primary EOU preview needs the Neural Engine (Apple Silicon). If unavailable, it falls back to the main engine with 2-second chunks.

Offline Transcription Engines

Personal Dictionary

Getting Started Guide

All Features

Words appear instantly

See what you're saying while you're still saying it.

Try it.

Pay once. Keep it forever. Nothing goes to the cloud.

Free trial included. Pro Pack $14.99 lifetime.