VinAI / VinBigData ASR

by VinAI Research

VinAI Research — Vietnamese-language ASR and speech research from the Vingroup AI arm.

TL;DR

VinAI Research — Vietnamese-language ASR and speech research from the Vingroup AI arm.

Best for vietnamese-language transcription where the commercial cloud STT engines are weak. Pricing: free.

Category
Open source
License
Stars
Last push
Pricing
free
Platforms
Linux, macOS

What it is

VinAI Research, the AI arm of Vingroup, publishes a steady stream of Vietnamese-language NLP and speech models on Hugging Face. The released checkpoints (including PhoWhisper variants tuned on Vietnamese) materially outperform stock Whisper on Vietnamese audio and are a load-bearing input for many Vietnamese-language ASR products. Best fit when the buyer is vietnamese-language transcription where the commercial cloud stt engines are weak. The honest caveat: research checkpoints; productionisation is the integrator's responsibility. As with any open-weights release, the integrator owns hosting, scaling, and SLA — but the licensing cost is zero and the model can be fine-tuned on in-house audio.

Best for: Vietnamese-language transcription where the commercial cloud STT engines are weak.
Watch out for: Research checkpoints; productionisation is the integrator's responsibility.

Install / use

vinai.io — Vietnamese ASR checkpoints on Hugging Face

Features

Speaker diarizationNo
Word-level timestampsYes
Streaming / real-timeNo
Languages supportedNone
HIPAA eligibleNo

VinAI / VinBigData ASR vs Whipscribe

FeatureVinAI / VinBigData ASRWhipscribe
CategoryOpen sourceTranscription APIs
Pricingfreefree beta
Speaker diarizationNoYes
Word timestampsYesYes
StreamingNoNo
Languages99
PlatformsLinux, macOSWeb, API, MCP

Alternatives to VinAI / VinBigData ASR

Whipscribe is a managed faster-whisper + whisperX service. If you want transcripts without running infrastructure, paste a URL or drop a file in the form below — you'll have a transcript in seconds.