Whisperer lets you choose between multiple Whisper models for transcription. But which one should you use? The answer depends on your hardware, language, and priorities.
Understanding Whisper Models#
OpenAI's Whisper is an open-source speech recognition model. whisper.cpp is a C++ implementation that runs efficiently on Apple Silicon. Whisperer uses whisper.cpp under the hood, giving you access to several model sizes.
Model Comparison#
| Model | Size | RAM Usage | Speed* | Accuracy | Best For |
|---|---|---|---|---|---|
| Tiny | ~75MB | ~200MB | Instant | Fair | Quick notes, low-end hardware |
| Base | ~150MB | ~350MB | Very fast | Good | Fast dictation, older Macs |
| Small | ~500MB | ~800MB | Fast | Very good | Balanced everyday use |
| Medium | ~1.5GB | ~2.5GB | Moderate | Excellent | High-accuracy dictation |
| Large V3 | ~3GB | ~5GB | Slower | Best | Maximum accuracy, long dictation |
| Turbo | ~800MB | ~1.5GB | Fast | Very good | Best speed/accuracy balance |
*Speed benchmarks on M1 MacBook Pro for a 30-second recording.
Our Recommendations#
For Most Users: Turbo (Recommended)
The Turbo model offers the best balance of speed and accuracy. It's significantly faster than Large V3 while maintaining near-equivalent accuracy for English. If you're on Apple Silicon, start here.
For Maximum Accuracy: Large V3
If accuracy is your top priority and you have a Mac with 16GB+ RAM, Large V3 gives the best results. It's particularly better for:
- Non-English languages
- Heavy accents
- Technical terminology
- Noisy environments
For Speed: Small or Base
If you want near-instant transcription and can tolerate occasional errors, the Small model is excellent. The Base model is even faster but accuracy drops noticeably for complex speech.
For Older Macs: Base or Tiny
Intel Macs and older Apple Silicon with 8GB RAM should use Base or Small. Larger models may cause memory pressure and slow down your system.
Language Considerations#
Whisper's accuracy varies by language. For English, even the Small model is quite good. For other languages:
- Western European languages (Spanish, French, German): Small or Turbo works well
- East Asian languages (Chinese, Japanese, Korean): Use Medium or Large V3
- Less-common languages: Large V3 recommended for best results
Model Preloading#
Whisperer can preload your selected model into memory so it's ready instantly when you press the record shortcut. This eliminates the 1–3 second cold start on first use. Enable this in Settings if you have enough RAM.
Hot-Swapping Models#
You can switch between models without restarting Whisperer. This is useful when you want:
- Turbo for quick notes during the day
- Large V3 for important dictation that needs to be accurate
Performance Tips#
Close memory-heavy apps
If using Large V3 on a 16GB Mac, free up RAM by closing unused applications.
Use Turbo for streaming
Use the balanced/Turbo model for live streaming preview — it's fast enough for real-time feedback.
Enable model preloading
Eliminate startup delay by keeping your selected model in memory.
Choose explicit language
Select your language explicitly over auto-detect for better accuracy.
Use a good microphone
Model accuracy matters less than input quality — invest in a decent mic.
The Bottom Line#
Start with the Turbo model. If accuracy isn't good enough, try Large V3. If speed is more important, drop to Small. Whisperer makes it easy to experiment — download multiple models and switch between them instantly.
Related: Whisper vs Parakeet vs Apple Speech, Offline Transcription Engines, Getting Started Guide. See pricing and all features.
Ready to try voice dictation on your Mac?
Free download. No account required. 100% offline.
Download on the Mac App Store