🗓️ Release Week Recap
Big week. mlx-audio and mlx-vlm are now among some of the fastest-growing OSS projects. Here’s what we shipped last week.
Gemma 4 on Apple Silicon
Two awesome releases by our partner
@GoogleDeepMind &
@googlegemma :
> Gemma 4 12B — their new dense, unified multimodal model. It uses an encoder free audio path and simplified vision encoder.
> Gemma 4 QAT — quantization-aware training checkpoints, optimized to run locally on consumer GPUs and edge devices, compressing the model while preserving the quality you expect from Gemma 4.
On the audio 🎧 side we added support for 15 new TTS, ASR & VAD models, faster long-form transcription, and an expanded OpenAI-compatible audio server. All local on Apple Silicon.
Huge thanks to every contributor and my co-maintainer
@lllucas. 🙏🏽