WaveNet (reference)
DeepMind's seminal 2016 neural-vocoder paper — historical reference only.
DeepMind's seminal 2016 neural-vocoder paper — historical reference only.
Best for academic teaching and historical baseline for vocoder research. Pricing: free (community reproductions, varied licenses).
What it is
WaveNet (DeepMind, 2016) is the paper that launched modern neural-vocoder work. The original code was never open-sourced; r9y9's MIT-licensed PyTorch reproduction is the most-used public re-implementation. Listed for historical completeness. Consent posture: vocoder architecture only — no end-user surface.
Watch out for: Original implementation never open-sourced by DeepMind; community reproductions vary in quality and licensing.
Install / use
git clone https://github.com/r9y9/wavenet_vocoder
Features
| Speaker diarization | No |
| Word-level timestamps | No |
| Streaming / real-time | No |
| Languages supported | 1 |
| HIPAA eligible | No |
WaveNet (reference) vs Whipscribe
| Feature | WaveNet (reference) | Whipscribe |
|---|---|---|
| Category | Open source | Transcription APIs |
| Pricing | free (community reproductions, varied licenses) | free beta |
| Speaker diarization | — | Yes |
| Word timestamps | — | Yes |
| Streaming | — | No |
| Languages | 1 | 99 |
| Platforms | Linux | Web, API, MCP |
Alternatives to WaveNet (reference)
Whipscribe is a managed faster-whisper + whisperX service. If you want transcripts without running infrastructure, paste a URL or drop a file in the form below — you'll have a transcript in seconds.