Kokoro TTS
by hexgrad (research)
Lightweight 82M-param open-source TTS — Apache-2.0, runs on a Raspberry Pi.
TL;DR
Lightweight 82M-param open-source TTS — Apache-2.0, runs on a Raspberry Pi.
Best for edge / on-device TTS where you need decent quality at 82M parameters. Pricing: free (Apache-2.0).
Category
Open source
License
—
Stars
—
Last push
—
Pricing
free (Apache-2.0)
Platforms
Linux, macOS, Windows
What it is
Kokoro is a permissively-licensed 82M-parameter TTS model that punches above its weight on the MOS leaderboard for its size. Apache-2.0 license. Consent posture: synthetic-only.
Best for: Edge / on-device TTS where you need decent quality at 82M parameters.
Watch out for: Synthetic voices only — no cloning surface in the public model.
Watch out for: Synthetic voices only — no cloning surface in the public model.
Install / use
pip install kokoro
Features
| Speaker diarization | No |
| Word-level timestamps | No |
| Streaming / real-time | No |
| Languages supported | 8 |
| HIPAA eligible | No |
Kokoro TTS vs Whipscribe
| Feature | Kokoro TTS | Whipscribe |
|---|---|---|
| Category | Open source | Transcription APIs |
| Pricing | free (Apache-2.0) | free beta |
| Speaker diarization | — | Yes |
| Word timestamps | — | Yes |
| Streaming | — | No |
| Languages | 8 | 99 |
| Platforms | Linux, macOS, Windows | Web, API, MCP |
Alternatives to Kokoro TTS
Whipscribe is a managed faster-whisper + whisperX service. If you want transcripts without running infrastructure, paste a URL or drop a file in the form below — you'll have a transcript in seconds.