TIMIT
by LDC / NIST / Texas Instruments + MIT
Phonetically-balanced 5h read-speech corpus from 1986 — phoneme recognition benchmark.
TL;DR
Phonetically-balanced 5h read-speech corpus from 1986 — phoneme recognition benchmark.
Best for phoneme-level recognition baselines; teaching ASR fundamentals. Pricing: paid.
Category
Open source
License
—
Stars
—
Last push
—
Pricing
paid
Platforms
LDC
What it is
TIMIT (LDC93S1) is the foundational phonetically-balanced read-speech corpus — 630 speakers across 8 US dialect regions, phone-aligned at 16 kHz. Still the standard for phoneme recognition pedagogy. LDC paid.
Best for: Phoneme-level recognition baselines; teaching ASR fundamentals.
Watch out for: LDC license · paid · 630 speakers from 8 dialect regions · 6300 utterances · phoneme-aligned. Cite: Garofolo et al., 1993.
Watch out for: LDC license · paid · 630 speakers from 8 dialect regions · 6300 utterances · phoneme-aligned. Cite: Garofolo et al., 1993.
Install / use
https://catalog.ldc.upenn.edu/LDC93S1 # LDC membership
Features
| Speaker diarization | No |
| Word-level timestamps | Yes |
| Streaming / real-time | No |
| Languages supported | 1 |
| HIPAA eligible | No |
TIMIT vs Whipscribe
| Feature | TIMIT | Whipscribe |
|---|---|---|
| Category | Open source | Transcription APIs |
| Pricing | paid | free beta |
| Speaker diarization | No | Yes |
| Word timestamps | Yes | Yes |
| Streaming | No | No |
| Languages | 1 | 99 |
| Platforms | LDC | Web, API, MCP |
Alternatives to TIMIT
Whipscribe is a managed faster-whisper + whisperX service. If you want transcripts without running infrastructure, paste a URL or drop a file in the form below — you'll have a transcript in seconds.