Did xAI just mass-murder the entire voice AI industry? 🤯
Grok just launched two voice APIs. Speech-to-Text and Text-to-Speech.
Built on the same stack powering Tesla cars and Starlink support.
And priced at 10x cheaper than ElevenLabs.
Speech-to-Text: $0.10/hr batch. $0.20/hr streaming.
Text-to-Speech: $4.20 per million characters.
25 languages. Real-time streaming. Speaker diarization.
Already outperforming ElevenLabs, Deepgram, and AssemblyAI on word error rate.
TTS ships with expressive tags like [laugh], [sigh], <whisper>, <emphasis>.
Voices that don't sound like robots reading a script.
ElevenLabs spent years building a voice AI company.
xAI built voice AI for cars and satellites.