Fluent Speech Commands

by Fluent.ai

30h spoken-language-understanding corpus — intent classification benchmark.

TL;DR

30h spoken-language-understanding corpus — intent classification benchmark.

Best for spoken language understanding (SLU) — action / object / location triplet classification. Pricing: research-only.

Category
Open source
License
Stars
Last push
Pricing
research-only
Platforms
Zenodo

What it is

Fluent Speech Commands is a 30h SLU corpus for smart-home voice commands. Each utterance has an action/object/location intent triplet. License: research-only.

Best for: Spoken language understanding (SLU) — action / object / location triplet classification.
Watch out for: Custom Fluent.ai non-commercial license · 23k utterances · 97 speakers · 31 intent classes. Cite: Lugosch et al., Interspeech 2019.

Install / use

https://zenodo.org/record/5630317  # registration

Features

Speaker diarizationNo
Word-level timestampsNo
Streaming / real-timeNo
Languages supported1
HIPAA eligibleNo

Fluent Speech Commands vs Whipscribe

FeatureFluent Speech CommandsWhipscribe
CategoryOpen sourceTranscription APIs
Pricingresearch-onlyfree beta
Speaker diarizationNoYes
Word timestampsNoYes
StreamingNoNo
Languages199
PlatformsZenodoWeb, API, MCP

Alternatives to Fluent Speech Commands

Whipscribe is a managed faster-whisper + whisperX service. If you want transcripts without running infrastructure, paste a URL or drop a file in the form below — you'll have a transcript in seconds.