Yesno (SLR-1)

by OpenSLR

Toy 60-utterance Hebrew corpus — the Kaldi 'hello world' dataset.

TL;DR

Toy 60-utterance Hebrew corpus — the Kaldi 'hello world' dataset.

Best for sanity-checking ASR toolkits (Kaldi yesno recipe); teaching beginners. Pricing: free.

Category
Open source
License
Stars
Last push
Pricing
free
Platforms
OpenSLR

What it is

Yesno (OpenSLR-1) is the toy dataset that every Kaldi tutorial starts with. 60 single-speaker Hebrew utterances of 'ken'/'lo'. Public domain.

Best for: Sanity-checking ASR toolkits (Kaldi yesno recipe); teaching beginners.
Watch out for: Public domain · 60 utterances · Hebrew · single speaker. Cite: bundled with Kaldi.

Install / use

https://www.openslr.org/1/

Features

Speaker diarizationNo
Word-level timestampsNo
Streaming / real-timeNo
Languages supported1
HIPAA eligibleNo

Yesno (SLR-1) vs Whipscribe

FeatureYesno (SLR-1)Whipscribe
CategoryOpen sourceTranscription APIs
Pricingfreefree beta
Speaker diarizationNoYes
Word timestampsNoYes
StreamingNoNo
Languages199
PlatformsOpenSLRWeb, API, MCP

Alternatives to Yesno (SLR-1)

Whipscribe is a managed faster-whisper + whisperX service. If you want transcripts without running infrastructure, paste a URL or drop a file in the form below — you'll have a transcript in seconds.