AI Hub Korea

by NIA (Korean National Information Society Agency)

Korean government open-data hub for speech + NLP corpora — 30+ speech datasets.

TL;DR

Korean government open-data hub for speech + NLP corpora — 30+ speech datasets.

Best for korean speech research; non-Korean researchers can also register. Pricing: research-only.

Category
Open source
License
Stars
Last push
Pricing
research-only
Platforms
Web

What it is

AI Hub Korea hosts 30+ Korean speech corpora (KsponSpeech, broadcast, dialect, child-speech, etc.) under government licensing. Account required.

Best for: Korean speech research; non-Korean researchers can also register.
Watch out for: AI Hub license · KR-account required · NON-COMMERCIAL in many cases · review per-dataset terms. Korean-government-funded.

Install / use

https://aihub.or.kr/  # Korean account required

Features

Speaker diarizationNo
Word-level timestampsNo
Streaming / real-timeNo
Languages supported3
HIPAA eligibleNo

AI Hub Korea vs Whipscribe

FeatureAI Hub KoreaWhipscribe
CategoryOpen sourceTranscription APIs
Pricingresearch-onlyfree beta
Speaker diarizationNoYes
Word timestampsNoYes
StreamingNoNo
Languages399
PlatformsWebWeb, API, MCP

Alternatives to AI Hub Korea

Whipscribe is a managed faster-whisper + whisperX service. If you want transcripts without running infrastructure, paste a URL or drop a file in the form below — you'll have a transcript in seconds.