🚀 VoxCPM 2 is live!
🎉 Another open-source AI
#TTS model from China — and one that stands shoulder to shoulder with Qwen3-TTS, while bringing everything into a single unified model. After rapid iterations from V1 (zero-shot cloning) to V1.5 (long-form fine-tuning),
#VoxCPM has consistently pushed quality and usability forward.
Now, VoxCPM 2 takes it further:
🔹30 languages — truly global, truly local.
🔹Infinite voice design — type it, hear it, control it. From a whisper to a booming cinematic voice.
🔹Studio-grade audio — 48kHz ultra-high fidelity with emotional depth
🔹Diffusion-Autoregressive cloning — preserves more acoustic and emotional detail than token-based models like Qwen3-TTS
💡 Big shoutout to
@grok — used your multi-image video magic for our launch demo. It’s scarily good at keeping visuals consistent across shots. Elon
@elonmusk, this one’s for you. 😉
Check the demo & start cloning your dream voice:
🌐 Hugging Face Space:
huggingface.co/spaces/openbm…
🤗 Hugging Face Model:
huggingface.openbmb.com/mode…
🤖 ModelScope Model:
modelscope.cn/models/OpenBMB…
💻 GitHub:
github.com/OpenBMB/VoxCPM/
#TTS #AI #VoiceCloning #GrokImagine #ElonMusk #OpenBMB #VoxCPM