March 15, 20266 min read

    Whisper vs Parakeet vs Apple Speech: Which Transcription Engine to Use

    Whisperer is the only Mac dictation app that supports three transcription engines — Whisper, Parakeet, and Apple Speech. Each uses different hardware, different models, and has different strengths. This guide helps you choose the right engine for your workflow.

    Engine Overview#

    WhisperParakeetApple Speech
    Librarywhisper.cpp (vendored)FluidAudio (CoreML)SpeechAnalyzer
    HardwareMetal GPUApple Neural EngineSystem ML
    Languages99+25 (v3), English (v2)System languages
    Models10+ (75MB–2.9GB)2 versionsBuilt-in
    AccuracyHighestVery highGood
    SpeedFast (GPU)Fastest (ANE)Fast
    Min macOS14+14+ (Apple Silicon)26+ (Tahoe)

    Whisper — The Default Engine#

    Whisper is OpenAI's open-source speech recognition model, running locally via whisper.cpp. Whisperer vendors the C library directly — no Python, no dependencies.

    How It Works#

    Whisper processes your complete audio recording after you release the record button. It runs on Apple Silicon's Metal GPU, which provides fast inference without blocking the CPU or Neural Engine for other tasks.

    Model Sizes#

    ModelSizeSpeedAccuracyBest For
    Tiny75MBFastestBasicQuick testing
    Base142MBVery fastGoodLow-storage devices
    Small466MBFastHighDaily dictation
    Medium1.5GBModerateVery highAccuracy-focused
    Large V32.9GBSlowerHighestMaximum accuracy
    Large V3 Turbo~1.5GBFastVery highBest balanced
    Tip

    Recommended starting point: Large V3 Turbo (~1.5GB). It offers near-Large accuracy at significantly faster speeds. For most users, this is the "set and forget" model.

    When to Use Whisper#

    • You need the highest accuracy available
    • You're dictating in non-English languages (99+ supported)
    • You want the widest model selection to trade speed for accuracy
    • You're doing file transcription (Whisper handles long audio well)

    Parakeet — The Fastest Engine#

    Parakeet is NVIDIA's CTC-based speech recognition model, running on Apple Silicon's Neural Engine via CoreML. It's the fastest option available — and it has a unique feature: CTC vocabulary boosting.

    How It Works#

    Parakeet uses the dedicated Neural Engine (ANE), which is separate from the GPU. This means Parakeet can run simultaneously with Whisper without resource contention. It's also used for Whisperer's live preview — the EOU (End-of-Utterance) engine that provides ~300ms preview latency.

    CTC Vocabulary Boosting#

    This is Parakeet's killer feature. Your personal dictionary entries and prompt words directly bias the CTC decoder at the acoustic level. This means:

    • Project-specific terms are recognized more accurately
    • Names, acronyms, and jargon are decoded correctly
    • The boost happens during decoding, not as a post-processing step
    Info

    Vocabulary boosting is unique to Parakeet's CTC architecture. Whisper (attention-based) and Apple Speech don't support this kind of direct decoder biasing. If accurate recognition of custom terms is critical, Parakeet with a well-configured dictionary is the best choice.

    When to Use Parakeet#

    • You want the fastest possible transcription
    • You have a large personal dictionary (vocabulary boosting)
    • You want to leave the GPU free for other tasks (video editing, ML training)
    • You primarily dictate in English (v2) or one of the 25 supported languages (v3)

    Apple Speech — The Native Option#

    Apple Speech uses macOS's built-in SpeechAnalyzer framework, available on macOS 26 (Tahoe) and later. Zero setup — no model downloads required.

    How It Works#

    Apple Speech leverages the system's built-in speech recognition pipeline. It's deeply integrated with macOS and uses Apple's own ML infrastructure. No additional models to download or manage.

    When to Use Apple Speech#

    • You're on macOS 26+ (Tahoe)
    • You want zero setup — no model downloads
    • You need quick, lightweight dictation without maximum accuracy
    • You're low on disk space and can't store Whisper models

    Head-to-Head Comparison#

    Accuracy#

    For English dictation:

    EngineShort PhrasesLong DictationTechnical TermsCode Terms
    Whisper (Large V3 Turbo)ExcellentExcellentVery goodGood
    Parakeet (v3)ExcellentVery goodExcellent*Good
    Apple SpeechGoodGoodFairPoor

    *With vocabulary boosting enabled and dictionary configured.

    Speed (Apple M1 Pro, typical 10-second recording)#

    EngineModelProcessing TimeReal-Time Factor
    Parakeet v3Default~0.8s0.08x
    WhisperLarge V3 Turbo~2.5s0.25x
    WhisperSmall~1.2s0.12x
    WhisperLarge V3~5.0s0.50x
    Apple SpeechBuilt-in~1.5s0.15x
    Tip

    Parakeet is 2–3x faster than Whisper Large V3 Turbo for typical dictation. If speed is your priority and you don't need 99+ languages, Parakeet is the clear winner.

    Resource Usage#

    EngineHardware UsedGPU Free?ANE Free?Battery Impact
    WhisperMetal GPUNoYesModerate
    ParakeetNeural EngineYesNoLow
    Apple SpeechSystem MLVariesVariesLow

    Parakeet's Neural Engine usage means your GPU stays free for video editing, 3D rendering, or ML workloads. This makes Parakeet ideal for multitasking.

    Choosing Your Engine#

    Best Overall

    Whisper Large V3 Turbo — the best balance of accuracy and speed. Works for 99+ languages. Start here.

    Fastest

    Parakeet — 2–3x faster, leaves GPU free. Best for English-primary users with large dictionaries.

    Zero Setup

    Apple Speech — no downloads, instant start. Good enough for casual dictation on macOS 26+.

    Maximum Accuracy

    Whisper Large V3 (2.9GB) — the most accurate model available. Use for file transcription or accuracy-critical work.

    Hot-Swapping Engines#

    Whisperer lets you switch between engines without restarting. Open Settings → Engine and select your preferred backend. The switch is instant — no app restart, no re-configuration.

    You can even use different engines for different purposes:

    • Parakeet for live dictation (fastest feedback)
    • Whisper Large V3 for file transcription (maximum accuracy)
    • Apple Speech when you're low on battery

    Related: Offline Transcription Engines, Best Whisper Model for Dictation, Whisper vs Parakeet Transcription. See all features and pricing.

    Ready to try voice dictation on your Mac?

    Free download. No account required. 100% offline.

    Download on the Mac App Store

    Related articles

    Ready to ditch typing?

    Join developers and power users who dictate faster than they type. One-time purchase. No subscription. No cloud.

    Free trial included. Pro Pack $14.99 lifetime.