<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Gemini Team</title>
    <description>The latest articles on DEV Community by Gemini Team (@geminiteam).</description>
    <link>https://dev.to/geminiteam</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F4002713%2Fe8ddaf55-395d-4449-92f4-64b8cb08b1eb.png</url>
      <title>DEV Community: Gemini Team</title>
      <link>https://dev.to/geminiteam</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/geminiteam"/>
    <language>en</language>
    <item>
      <title>Fluid, natural voice translation with Gemini 3.5 Live Translate</title>
      <dc:creator>Gemini Team</dc:creator>
      <pubDate>Tue, 30 Jun 2026 15:54:14 +0000</pubDate>
      <link>https://dev.to/googleai/fluid-natural-voice-translation-with-gemini-35-live-translate-27n9</link>
      <guid>https://dev.to/googleai/fluid-natural-voice-translation-with-gemini-35-live-translate-27n9</guid>
      <description>&lt;p&gt;Twenty years ago, &lt;a href="https://blog.google/products-and-platforms/products/translate/fun-facts-google-translate-20-years/" rel="noopener noreferrer"&gt;translation at Google&lt;/a&gt; began as one of our pioneering machine learning experiments to turn the science of language into the magic of human connection. That experiment has come a long way with over a trillion words being translated for billions of users across our products every month.&lt;/p&gt;

&lt;p&gt;We’re taking our next step with the release of Gemini 3.5 Live Translate, our latest audio model for live speech-to-speech translation.&lt;/p&gt;

&lt;p&gt;The model automatically detects 70+ languages and generates smooth, natural-sounding translated speech that preserves the speakers' intonation, pacing and pitch. Unlike turn by turn systems that wait for the speaker to finish speaking before responding, 3.5 Live Translate generates speech continuously, balancing the trade-off between waiting for context to improve quality and translating immediately to stay in sync with the speaker. It delivers fluid audio without awkward pauses and stays just a few seconds behind the speaker throughout the session.&lt;/p&gt;

&lt;p&gt;Gemini 3.5 Live Translate is rolling out across Google products:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;For developers in public preview via the &lt;a href="https://ai.google.dev/gemini-api/docs/live-api/live-translate" rel="noopener noreferrer"&gt;Gemini Live API&lt;/a&gt; and &lt;a href="https://aistudio.google.com/live?model=gemini-3.5-live-translate-preview" rel="noopener noreferrer"&gt;Google AI Studio&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;For enterprises in private preview starting this month in &lt;a href="https://workspace.google.com/products/meet/" rel="noopener noreferrer"&gt;Google Meet&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;For everyone via Google Translate on &lt;a href="https://play.google.com/store/apps/details?id=com.google.android.apps.translate&amp;amp;26hl=en" rel="noopener noreferrer"&gt;Android&lt;/a&gt; and &lt;a href="https://apps.apple.com/us/app/google-translate/id414706506" rel="noopener noreferrer"&gt;iOS&lt;/a&gt;
&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Build with 3.5 Live Translate
&lt;/h2&gt;

&lt;p&gt;Gemini 3.5 Live Translate processes speech as it’s streamed, enabling a more seamless connection across languages. The model handles multilingual inputs without the need to manually configure settings. At the same time, its noise robustness ensures applications can handle loud, unpredictable environments. You can use its capabilities to help facilitate live interpretation for multilingual calls, meetings, lessons, broadcasts and more.&lt;/p&gt;

&lt;p&gt;  &lt;iframe src="https://www.youtube.com/embed/TNwKs39uSVk"&gt;
  &lt;/iframe&gt;
&lt;/p&gt;

&lt;p&gt;Watch the Gemini Live API in action, enabling dubbing and simultaneous multi-language translation. Dive into the &lt;a href="https://github.com/google-gemini/gemini-live-api-examples/tree/main/gemini-live-translate-livekit" rel="noopener noreferrer"&gt;demo&lt;/a&gt; or more &lt;a href="https://github.com/google-gemini/gemini-live-api-examples" rel="noopener noreferrer"&gt;example code&lt;/a&gt; in the Gemini Cookbook.&lt;/p&gt;

&lt;p&gt;By utilizing the Gemini Live API, developer platforms like &lt;a href="https://docs.agora.io/en/conversational-ai/models/mllm/gemini" rel="noopener noreferrer"&gt;Agora&lt;/a&gt;, &lt;a href="https://docs.fishjam.io/tutorials/gemini-live-integration" rel="noopener noreferrer"&gt;Fishjam&lt;/a&gt;, &lt;a href="https://docs.livekit.io/agents/models/realtime/plugins/gemini/" rel="noopener noreferrer"&gt;LiveKit&lt;/a&gt;, &lt;a href="https://docs.pipecat.ai/guides/features/gemini-live" rel="noopener noreferrer"&gt;Pipecat&lt;/a&gt;, and &lt;a href="https://visionagents.ai/integrations/gemini" rel="noopener noreferrer"&gt;Vision Agents&lt;/a&gt; enable developers to build and deploy voice translation apps with ease. These integrations handle the complex real-time media streaming infrastructure, so developers can focus on the user experience.&lt;/p&gt;

&lt;p&gt;Our partners at Grab are testing the model to enable multilingual communication in near real-time between drivers and travelers at pickups. These users make over 10 million voice calls per month through Grab.&lt;br&gt;
&amp;nbsp;&lt;br&gt;
  &lt;iframe src="https://www.youtube.com/embed/16Y2DU6LJX4"&gt;
  &lt;/iframe&gt;
&lt;/p&gt;

&lt;center&gt;&lt;small&gt;See how Grab has been testing 3.5 Live Translate to transform communication between users.&lt;/small&gt;&lt;/center&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;

&lt;h2&gt;
  
  
  Read the early reviews
&lt;/h2&gt;

&lt;p&gt;In addition to Grab, companies like CJ ENM, LiveKit and others have shared positive feedback on 3.5 Live Translate highlighting its impressive translation quality, accuracy and low latency:&lt;/p&gt;


&lt;div class="crayons-card c-embed"&gt;

  &lt;br&gt;
"While testing Gemini 3.5 Live Translate, we’ve valued its ability to auto-detect multiple languages and translate speech accurately with low latency."&lt;br&gt;
&lt;strong&gt;Philipp Kandal&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Chief Product Office at Grab&lt;/em&gt;&lt;br&gt;

&lt;/div&gt;



&lt;div class="crayons-card c-embed"&gt;

  &lt;br&gt;
"CJ ENM is excited to partner with Google DeepMind on 3.5 Live Translate. Early tests show promising quality for a more authentic experience for global &amp;amp; Korean viewers."&lt;br&gt;
&lt;strong&gt;Bella Baek&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Chief AI Officer at CJ ENM&lt;/em&gt;&lt;br&gt;

&lt;/div&gt;



&lt;div class="crayons-card c-embed"&gt;

  &lt;br&gt;
"Gemini 3.5 Live Translate makes multilingual voice effortless. I built a demo on LiveKit Agents where everyone speaks their own language and understands each other live."&lt;br&gt;
&lt;strong&gt;Jesse Hall&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Staff Developer Advocate at LiveKit&lt;/em&gt;&lt;br&gt;

&lt;/div&gt;



&lt;div class="crayons-card c-embed"&gt;

  &lt;br&gt;
"During our time with the 3.5 Live Translate model, we tested across several languages, and our team was blown away by the speed, accuracy, and liveliness of the model."&lt;br&gt;
&lt;strong&gt;Nash Ramdial&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Director at Vision Agents&lt;/em&gt;&lt;br&gt;

&lt;/div&gt;



&lt;div class="crayons-card c-embed"&gt;

  &lt;br&gt;
"Gemini 3.5 Live Translate paired with Fishjam’s MoQ protocol sets a new frontier for real-time multimedia streaming, allowing speech-to-speech translation into over 70 languages."&lt;br&gt;
&lt;strong&gt;Maciej Rys&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;VP of Engineering at Software Mansion&lt;/em&gt;&lt;br&gt;

&lt;/div&gt;



&lt;div class="crayons-card c-embed"&gt;

  &lt;br&gt;
"We tested the Gemini 3.5 Live Translate model at Agora and in our opinion it provided SOTA results, with low latency and high accuracy that set a new bar for real-time translation."&lt;br&gt;
&lt;strong&gt;Mason Adams&lt;/strong&gt;&lt;br&gt;
&lt;em&gt;Developer Evangelist at Agora&lt;/em&gt;&lt;br&gt;

&lt;/div&gt;


&lt;h2&gt;
  
  
  Experience 3.5 Live Translate in your video meetings
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://support.google.com/meet/answer/16221730?hl=en" rel="noopener noreferrer"&gt;Speech translation&lt;/a&gt; in Google Meet will soon use 3.5 Live Translate, improving the experience by:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Offering 70+ languages, an improvement from the previous limit of just five languages,&lt;/li&gt;
&lt;li&gt;Enabling conversations across over 2000+ language combinations in one meeting, expanding from the previous state of only translating to and from English,&lt;/li&gt;
&lt;li&gt;Updating the interface to provide instant access to speech translation.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;We’re launching this update in private preview for select business Google Workspace customers starting this month, followed by a broader rollout later this year.&lt;br&gt;
&amp;nbsp;&lt;br&gt;
  &lt;iframe src="https://www.youtube.com/embed/DLSLKCqahyI"&gt;
  &lt;/iframe&gt;
&lt;/p&gt;

&lt;center&gt;&lt;small&gt;Google Meet participants use speech translation to communicate across English, Mandarin, and Swedish.&lt;/small&gt;&lt;/center&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;

&lt;h2&gt;
  
  
  Get 3.5 Live Translate in the Google Translate app on Android or iOS
&lt;/h2&gt;

&lt;p&gt;The model is also rolling out on the Google Translate app globally, on both &lt;a href="https://play.google.com/store/apps/details?id=com.google.android.apps.translate" rel="noopener noreferrer"&gt;Android&lt;/a&gt; and &lt;a href="https://apps.apple.com/us/app/google-translate/id414706506" rel="noopener noreferrer"&gt;iOS&lt;/a&gt;. When using the Live translate feature, simply connect any pair of headphones to experience a more seamless translation that mirrors the speaker’s tone across 70+ languages.&lt;/p&gt;

&lt;p&gt;For Android users, we’re also starting to roll out a new ‘listening mode’ with 3.5 Live Translate that lets you hear translations directly through your phone’s earpiece. Simply hold your phone to your ear just like a regular call, and the translated audio streams straight to you. This new experience can be helpful in situations where you want to quickly hear translations without others hearing, and you don’t have your headphones handy.&lt;br&gt;
&amp;nbsp;&lt;/p&gt;


  
  Your browser does not support the video tag.


&lt;center&gt;&lt;small&gt;Using the new listening mode, users can hear a near real-time English translation of a guided tour in Spanish directly through their phone's earpiece.&lt;/small&gt;&lt;/center&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;

&lt;h2&gt;
  
  
  Watermarked with SynthID
&lt;/h2&gt;

&lt;p&gt;All audio generated by our models is watermarked with SynthID. This imperceptible watermark is woven directly into the audio output, ensuring AI-generated content remains detectable to help prevent misinformation. For details on our approach to safety and responsibility, review the &lt;a href="https://deepmind.google/models/model-cards/gemini-3-5-audio/" rel="noopener noreferrer"&gt;model card&lt;/a&gt;.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>google</category>
      <category>gemini</category>
      <category>machinelearning</category>
    </item>
  </channel>
</rss>
