MLX Data

by Apple

Audio + image data loaders for MLX training.

TL;DR

Audio + image data loaders for MLX training.

Best for building Whisper-fine-tune pipelines on Apple Silicon. Pricing: free.

Category
Open source
License
Stars
Last push
Pricing
free
Platforms
macOS

What it is

Data-loading library to feed MLX models efficiently. MIT.

Best for: Building Whisper-fine-tune pipelines on Apple Silicon.
Watch out for: Newer than torch-audio; smaller dataset coverage.

Install / use

pip install mlx-data

Features

Speaker diarizationNo
Word-level timestampsYes
Streaming / real-timeNo
Languages supported99
HIPAA eligibleNo

MLX Data vs Whipscribe

FeatureMLX DataWhipscribe
CategoryOpen sourceTranscription APIs
Pricingfreefree beta
Speaker diarizationNoYes
Word timestampsYesYes
StreamingNoNo
Languages9999
PlatformsmacOSWeb, API, MCP

Alternatives to MLX Data

Whipscribe is a managed faster-whisper + whisperX service. If you want transcripts without running infrastructure, paste a URL or drop a file in the form below — you'll have a transcript in seconds.