AI4Bharat IndicConformer

by AI4Bharat / IIT Madras

Open-source Indic ASR models from IIT Madras' AI4Bharat lab — 22 scheduled Indian languages.

TL;DR

Open-source Indic ASR models from IIT Madras' AI4Bharat lab — 22 scheduled Indian languages.

Best for researchers and Indian-language product teams who need open weights for fine-tuning. Pricing: free.

Category
Open source
License
Stars
Last push
Pricing
free
Platforms
Linux, macOS, Windows

What it is

IndicConformer is AI4Bharat's open-source ASR model family targeting India's 22 scheduled languages. Released under permissive licenses with weights on Hugging Face, the models are the natural starting point for any team that needs to self-host Indic STT rather than rely on a commercial API. The same lab also publishes IndicTrans and IndicBERT, so the broader pipeline is reachable. Best fit when the buyer is researchers and indian-language product teams who need open weights for fine-tuning. The honest caveat: conformer architecture is non-streaming by default; english is not in scope. As with any open-weights release, the integrator owns hosting, scaling, and SLA — but the licensing cost is zero and the model can be fine-tuned on in-house audio.

Best for: Researchers and Indian-language product teams who need open weights for fine-tuning.
Watch out for: Conformer architecture is non-streaming by default; English is not in scope.

Install / use

ai4bharat.iitm.ac.in — model cards on Hugging Face under ai4bharat/

Features

Speaker diarizationNo
Word-level timestampsYes
Streaming / real-timeNo
Languages supported22
HIPAA eligibleNo

AI4Bharat IndicConformer vs Whipscribe

FeatureAI4Bharat IndicConformerWhipscribe
CategoryOpen sourceTranscription APIs
Pricingfreefree beta
Speaker diarizationNoYes
Word timestampsYesYes
StreamingNoNo
Languages2299
PlatformsLinux, macOS, WindowsWeb, API, MCP

Alternatives to AI4Bharat IndicConformer

Whipscribe is a managed faster-whisper + whisperX service. If you want transcripts without running infrastructure, paste a URL or drop a file in the form below — you'll have a transcript in seconds.