🎙️ Google updated Gemini 2.5 Flash Native Audio for live voice agents, and it added live speech to speech translation to the Translate app.
The update focuses on
- better tool calling,
- more accurate instruction following, and
- smoother and more cohesive multi turn conversation quality.
Tool calling is more reliable, it knows when to fetch live info, and it scores 71.5% on ComplexFuncBench Audio.
Instruction following improved to 90% adherence, up from 84%, so the agent breaks rules less often.
Multi turn quality improved because it uses earlier context better across longer back and forth chats.
Gemini 2.5 Flash Native Audio is available in Google AI Studio and Vertex AI, and it is rolling out into Gemini Live and Search Live.
Live speech translation can keep listening into 1 target language, or run a 2 way chat that switches languages by speaker.
Translation keeps tone, pacing, and pitch, and it supports 70 languages and about 2,000 language pairs with auto detection and noise filtering.
The Translate beta is rolling out on Android in the US, Mexico, and India, with iOS and more regions later, and an API release planned for 2026.