๐ VoxCPM 2 is live!
๐ Another open-source AI
#TTS model from China โ and one that stands shoulder to shoulder with Qwen3-TTS, while bringing everything into a single unified model. After rapid iterations from V1 (zero-shot cloning) to V1.5 (long-form fine-tuning),
#VoxCPM has consistently pushed quality and usability forward.
Now, VoxCPM 2 takes it further:
๐น30 languages โ truly global, truly local.
๐นInfinite voice design โ type it, hear it, control it. From a whisper to a booming cinematic voice.
๐นStudio-grade audio โ 48kHz ultra-high fidelity with emotional depth
๐นDiffusion-Autoregressive cloning โ preserves more acoustic and emotional detail than token-based models like Qwen3-TTS
๐ก Big shoutout to
@grok โ used your multi-image video magic for our launch demo. Itโs scarily good at keeping visuals consistent across shots. Elon
@elonmusk, this oneโs for you. ๐
Check the demo & start cloning your dream voice:
๐ Hugging Face Space:
huggingface.co/spaces/openbmโฆ
๐ค Hugging Face Model:
huggingface.openbmb.com/modeโฆ
๐ค ModelScope Model:
modelscope.cn/models/OpenBMBโฆ
๐ป GitHub๏ผ
github.com/OpenBMB/VoxCPM/
#TTS #AI #VoiceCloning #GrokImagine #ElonMusk #OpenBMB #VoxCPM