Drop your audio. Transcript in seconds. 30 free min, then $2 = 200 min
SuperWhisper
System-wide voice dictation for macOS, Windows, and iOS — hold a hotkey, speak, and AI-formatted text drops into the focused field. Modes rewrite the transcript for email, code, notes, or chat.
Superwhisper is a native dictation app that wires Whisper (local) and frontier LLMs (cloud, BYOK) to a global hotkey. Hold the key in any app — IDE, mail client, Slack, browser — speak, and the transcript types itself into the focused text field. The differentiator is Modes: each mode is a saved profile of voice model + AI rewrite prompt + auto-activation rule, so the same dictated phrase becomes a tight Slack reply in Slack, a formal paragraph in Gmail, or commented Python in your editor. Built-in modes ship with the app — Voice to Text, Message, Email, Note, Super, Meeting — and custom modes let you write the system prompt yourself.
Best for people who type for a living and want voice as the default input across the OS — engineers dictating commit messages, founders firing off email replies, journalists capturing thoughts into Obsidian. Three tiers: a forever-free plan (small local models, unlimited use), a Pro subscription (cloud + local AI models, BYOK, file transcription, translate-to-English), and an Enterprise plan (SOC 2, centralized billing, model access control). The Free tier also includes a 15-minute trial of Pro features. Verify current pricing on the homepage pricing toggle — monthly / yearly (2 months free) / lifetime are offered; 40% student discount on Pro.
What it is
SuperWhisper turns Whisper into a system-wide dictation shortcut. Hold a key anywhere on macOS or iOS, speak, and the transcript types itself into whatever text field you're in. Great for email, IDE comments, and chat — not the right tool for transcribing podcasts.
Watch out for: Not a file-transcription tool; optimized for short dictation, not long-form media.
Install / use
Download SuperWhisper superwhisper.comBuilt-in modes · 6 dictation profiles
Each mode bundles a voice model, an AI rewrite prompt, and an auto-activation rule (per-app or per-website). Switch with a hotkey or let the mode trigger itself when you focus a specific app. The list below maps the shipped defaults to the surface they were tuned for — start here, then duplicate one to build a custom mode.
Rewrites loose speech into a structured email body — paragraph breaks, polite phrasing, kept-on-topic. Auto-activates in Mail, Outlook, Gmail in the browser, or any client you whitelist. Tune the underlying prompt to match your voice (terse vs warm, first-person vs team-voice).
Ships built-in. Pair with a cloud LLM on Pro for tighter rewrites; small local model works on Free.
Casual short-form output — keeps contractions, drops filler words, no email-style greeting. Designed for chat surfaces where verbosity feels off. Auto-activate it on Slack and your DM apps and you can dictate replies as fast as you'd type them, without the autocorrect tax.
Light rewrite. Works well with the local small model on Free; cloud models on Pro polish faster.
Keeps your raw stream-of-thought largely intact but cleans up false starts and disfluencies. Useful when you want the structure you spoke, not the LLM's interpretation of it. Auto-activate on your notes app of choice for a friction-free capture loop.
Built-in. Disable the rewrite step entirely if you want pure transcription with no model interference.
The shipped Super mode handles general-purpose dictation; the recipe for code work is to duplicate it and write a custom system prompt that emits markdown, fenced code blocks, or commented prose. Pair with the Pi coding-agent integration added in v2.13.2 for agent-style follow-on actions.
BYOK on Pro: GPT-5, Claude Haiku 4.5, Llama 4, Grok 4.1, Gemini 3.0 Flash, Ministral, Opus 4.7. Local fallback when offline.
The exception to the dictation-only positioning — Meeting mode captures and transcribes long-form audio (your own meetings) and produces a structured transcript. Distinct from the file-transcription path (which accepts MP3/MP4/16 kHz mono WAV from Finder, menu bar, or CLI).
File transcription is gated to Pro; live Meeting capture works on Free with usage limits.
Duplicate any built-in mode and override the AI rewrite prompt, the voice model, the language, and the auto-activation rule (app bundle ID or website URL pattern). This is where Superwhisper diverges from generic dictation — your mode library becomes your personal voice-to-text DSL.
Local Whisper Large for offline; cloud models on Pro for higher rewrite quality.
Getting started in 3 steps
Pick your platform — macOS 13.3+, Windows 10+, or iOS 18+ via the App Store. The Mac and Windows builds need Accessibility permission so the global hotkey can intercept keystrokes and type the transcript into the focused field.
1. Open https://superwhisper.com/download in your browser.
2. Pick the platform card that matches your OS:
- macOS: requires 13.3 or later (Apple Silicon strongly preferred for local models).
- Windows: 10 or later, x64 or ARM64 build.
- iOS: requires iOS 18 or later, installed via the App Store.
3. Install via the standard installer flow for your OS.
4. On first launch, the app will prompt for:
- Accessibility access (so the hotkey can type into other apps).
- Microphone access (obvious).
5. Grant both, then set your global hotkey in Settings.
Skip the empty-Notes-window demo. Open the app where you'll actually use voice — your IDE, your mail client, your chat app — and trigger the matching built-in mode. The point of Modes is that the output already fits the surface.
1. Open Mail (or Gmail in the browser) and click into a draft reply.
2. Hold the global hotkey (default varies by OS — set in Settings).
3. Speak: "Sounds good — let's pick this up Thursday afternoon."
4. Release the hotkey.
5. The Email mode rewrites and types the result into the draft.
6. Repeat in Slack with Message mode, in your IDE with a code mode, etc.
7. Set per-app auto-activation under Mode settings so the right
mode picks itself when you focus that app.
The single highest-leverage configuration step — duplicate the closest built-in mode and rewrite the system prompt to match your actual writing voice and the formatting your target app expects (markdown, fenced code, bullet lists, specific tone).
1. Settings → Modes → duplicate the closest built-in mode
(e.g. duplicate Message mode for a custom Slack mode).
2. Edit the AI Prompt field — be specific:
- Tone you want ("terse engineer voice", "warm but professional").
- Formatting rules ("emit markdown", "wrap code in fences", "use bullet lists for steps").
- Constraints ("keep under 3 sentences", "do not add greetings").
3. Pick the voice model — local Whisper for offline / privacy,
or a cloud LLM via BYOK on Pro for higher rewrite quality.
4. Set the auto-activation rule — match by app bundle ID or by
website URL pattern so the mode picks itself.
5. Trigger the hotkey while focused on the target app.
Features
| Speaker diarization | No |
| Word-level timestamps | No |
| Streaming / real-time | Yes |
| Languages supported | 99 |
| HIPAA eligible | No |
Links
- superwhisper.com · homepage ↗ ↗product positioning, pricing toggle (monthly / yearly / lifetime), feature matrix
- superwhisper.com · download ↗ ↗downloads landing for macOS 13.3+, Windows 10+, and iOS 18+ (App Store)
- docs · get-started · introduction ↗ ↗onboarding: system permissions, language selection, microphone setup
- docs · modes ↗ ↗modes concept, built-in mode list (Voice to Text, Message, Email, Note, Super, Meeting, Custom), auto-activation rules
- docs · keyboard shortcuts ↗ ↗configure the global hotkey and the per-mode trigger shortcuts
- docs · file transcription ↗ ↗transcribe MP3 / MP4 / 16 kHz mono WAV from menu bar, Finder, or CLI (Pro-tier feature)
- superwhisper.com · changelog ↗ ↗reverse-chronological release notes — v2.13.2 (2026-04-29) added BYOK Opus 4.6/4.7 and Pi coding-agent support
- x.com · @superwhisperapp ↗ ↗official Twitter / X account — release announcements and demos
- Discord · Superwhisper community ↗ ↗vendor-run community server — fastest path to support and mode-sharing
SuperWhisper vs Whipscribe
| Feature | SuperWhisper | Whipscribe |
|---|---|---|
| Category | Desktop apps | Transcription APIs |
| Pricing | freemium | free beta |
| Speaker diarization | No | Yes |
| Word timestamps | No | Yes |
| Streaming | Yes | No |
| Languages | 99 | 99 |
| Platforms | macOS, Windows, iOS | Web, API, MCP |
Alternatives to SuperWhisper
Whipscribe is a managed faster-whisper + whisperX service. If you want transcripts without running infrastructure, paste a URL or drop a file in the form below — you'll have a transcript in seconds.