SPGISpeech
by Kensho Technologies (S&P Global)
5000h of professionally-transcribed earnings-call audio — financial-domain ASR.
TL;DR
5000h of professionally-transcribed earnings-call audio — financial-domain ASR.
Best for domain-adapted ASR for financial/earnings-call transcription; non-native English accents. Pricing: research-only.
Category
Open source
License
—
Stars
—
Last push
—
Pricing
research-only
Platforms
Web
What it is
SPGISpeech is 5000 hours of professional-grade earnings-call transcription from Kensho. Earnings calls feature heavy non-native accents and finance jargon — the canonical financial-domain ASR benchmark. Research-only license.
Best for: Domain-adapted ASR for financial/earnings-call transcription; non-native English accents.
Watch out for: Custom license · NON-COMMERCIAL ONLY (Kensho research license) · registration + agreement required · ~50k speakers across ~16 industries. Cite: O'Neill et al., Interspeech 2021.
Watch out for: Custom license · NON-COMMERCIAL ONLY (Kensho research license) · registration + agreement required · ~50k speakers across ~16 industries. Cite: O'Neill et al., Interspeech 2021.
Install / use
https://datasets.kensho.com/datasets/spgispeech # registration required
Features
| Speaker diarization | No |
| Word-level timestamps | No |
| Streaming / real-time | No |
| Languages supported | 1 |
| HIPAA eligible | No |
SPGISpeech vs Whipscribe
| Feature | SPGISpeech | Whipscribe |
|---|---|---|
| Category | Open source | Transcription APIs |
| Pricing | research-only | free beta |
| Speaker diarization | No | Yes |
| Word timestamps | No | Yes |
| Streaming | No | No |
| Languages | 1 | 99 |
| Platforms | Web | Web, API, MCP |
Alternatives to SPGISpeech
Whipscribe is a managed faster-whisper + whisperX service. If you want transcripts without running infrastructure, paste a URL or drop a file in the form below — you'll have a transcript in seconds.