CUHK Cantonese ASR

by CUHK Speech Group

Chinese University of Hong Kong — Cantonese speech research and open checkpoints.

TL;DR

Chinese University of Hong Kong — Cantonese speech research and open checkpoints.

Best for cantonese transcription for Hong Kong and Guangdong-area journalism and broadcast. Pricing: free.

Category
Open source
License
Stars
Last push
Pricing
free
Platforms
Linux

What it is

CUHK's speech group has published Cantonese-tuned ASR checkpoints that handle the language's tonal phonology and code-switching with Mandarin and English more reliably than stock multilingual Whisper. Cantonese remains weakly supported by big-cloud STT, so these releases are the practical baseline for Hong Kong and Guangdong-area transcription projects. Best fit when the buyer is cantonese transcription for hong kong and guangdong-area journalism and broadcast. The honest caveat: research-grade releases; production hardening is integrator's responsibility. As with any open-weights release, the integrator owns hosting, scaling, and SLA — but the licensing cost is zero and the model can be fine-tuned on in-house audio.

Best for: Cantonese transcription for Hong Kong and Guangdong-area journalism and broadcast.
Watch out for: Research-grade releases; production hardening is integrator's responsibility.

Install / use

huggingface.co search 'cantonese asr' for model cards

Features

Speaker diarizationNo
Word-level timestampsYes
Streaming / real-timeNo
Languages supported1
HIPAA eligibleNo

CUHK Cantonese ASR vs Whipscribe

FeatureCUHK Cantonese ASRWhipscribe
CategoryOpen sourceTranscription APIs
Pricingfreefree beta
Speaker diarizationNoYes
Word timestampsYesYes
StreamingNoNo
Languages199
PlatformsLinuxWeb, API, MCP

Alternatives to CUHK Cantonese ASR

Whipscribe is a managed faster-whisper + whisperX service. If you want transcripts without running infrastructure, paste a URL or drop a file in the form below — you'll have a transcript in seconds.