Fascinating. Google just released Gemini 3.5 Live Translate.
A live speech-to-speech translation model that starts speaking in another language while the original speaker is still talking.
Older translation systems often wait for a full sentence, because early words can be misleading until later words reveal tense, intent, or context.
Gemini 3.5 instead runs streaming translation, where the model listens, interprets partial meaning, predicts what can safely be translated, and keeps updating as new speech arrives.
supports 70 languages, stays only a few seconds behind the speaker, and can preserve pacing, pitch, and intonation across longer sessions.
Rolling out to Gemini Live API, businesses through Google Meet preview, and regular users through Google Translate on Android and iOS.
Today, we released Gemini 3.5 Live Translate, our latest audio model for live speech-to-speech translation.
It supports over 70 languages and starts translating as soon as you start talking, streaming translations while listening to what you say next. No awkward pauses or choppy audio, just real connection without language barriers.
So, how does it work? 🤔
The model is able to make split-second decisions to juggle speed and translation quality so conversations actually feel fluid, human, and natural. In order to do this, the model must receive and contextualize the input while simultaneously outputting the translated speech.
Through this process, Gemini 3.5 Live Translate manages to stay mere seconds behind each speaker and can even maintain pacing, pitch, and intonation across extended sessions.
See it in action below, or try it yourself in the Google Translate app on iOS & Android.