Google Launches Gemini 3.5 Live Translate Across Its Services

Google 在其服務中推出 Gemini 3.5 Live Translate 即時翻譯


Introduction

Google has introduced Gemini 3.5 Live Translate, a new AI model designed for speech-to-speech communication in many different languages in near real-time.

Google 推出了 Gemini 3.5 Live Translate,這是一款全新的 AI 模型,旨在實現多種不同語言之間近乎即時的語音對語音通訊。

Main Body

The development of Gemini 3.5 Live Translate marks a shift from relying on specific hardware to a more flexible software approach. In the past, real-time translation required specific Google devices, such as Pixel Buds. However, the new version allows users to use various earbud brands or a special 'listening mode' on Android devices, where the phone is held to the ear to hear translations.

Gemini 3.5 Live Translate 的開發標誌著從依賴特定硬體轉向更靈活的軟體方法。過去,即時翻譯需要特定的 Google 裝置,例如 Pixel Buds。然而,新版本允許使用者使用各種品牌的耳機,或在 Android 裝置上使用特殊的「聆聽模式」,將手機貼近耳朵即可聽到翻譯。

Technologically, the model is different because it uses continuous streaming instead of waiting for a speaker to finish a sentence. This system can automatically detect over 70 languages and reduces delays to only a few seconds. Furthermore, the system is designed to work in noisy environments by filtering out background sounds. To make the translation sound more natural, the AI tries to copy the speaker's original tone, speed, and pitch, which makes the voice sound less like a machine.

在技術上,該模型的不同之處在於它使用連續串流,而非等待說話者完成句子。此系統能自動偵測 70 多種語言,並將延遲降低至僅僅幾秒。此外,該系統設計用於在嘈雜環境中工作,可濾除背景雜音。為了讓翻譯聽起來更自然,AI 會嘗試模仿說話者的原始語調、速度和音高,使聲音聽起來不像機器。

Regarding its release, Google is launching the tool in stages. Developers can already access the model through the Gemini Live API or AI Studio. Additionally, business clients will see the tool integrated into Google Meet starting this month, followed by a general release in the Google Translate app for iOS and Android. To prevent the misuse of AI-generated audio, Google has added SynthID watermarks to all audio streams so that the AI speech can be easily identified.

關於發佈情況,Google 正分階段推出該工具。開發者已可透過 Gemini Live API 或 AI Studio 存取該模型。此外,企業客戶將從本月起在 Google Meet 中看到該工具的整合,隨後將在 iOS 和 Android 的 Google 翻譯 App 中全面發佈。為了防止 AI 生成音訊的濫用,Google 在所有音訊串流中加入了 SynthID 浮水印,以便能輕鬆識別 AI 語音。

Conclusion

Gemini 3.5 Live Translate is currently available to developers and some business users, while a wider release for the general public is expected soon.

Gemini 3.5 Live Translate 目前已向開發者及部分企業用戶開放,預計很快將向一般大眾全面發佈。

Vocabulary Learning

🚀 The "B2 Leap": Moving from Simple to Sophisticated Descriptions

At an A2 level, you describe things using simple verbs like is, has, or works. To reach B2, you need to describe processes and changes.

Look at this sentence from the text:

"The development of Gemini 3.5 Live Translate marks a shift from relying on specific hardware to a more flexible software approach."

💡 The Magic Phrase: "Marks a shift from X to Y"

Instead of saying "Things are different now," a B2 speaker says "This marks a shift from [Old Way] to [New Way]." It describes a transition perfectly.

Compare these two styles:

  • A2 (Basic): Google had special earbuds. Now, any earbuds work. It is a change.
  • B2 (Advanced): This new update marks a shift from using specific Google hardware to a more flexible software approach.

🛠️ Expanding Your "Connectors"

B2 speakers don't just use and or but. They use words that guide the reader. In the article, we see:

  1. "Furthermore" \rightarrow Use this instead of "Also" when adding a strong new point.
  2. "Regarding..." \rightarrow Use this to introduce a new topic (e.g., "Regarding the price, it is free.")
  3. "Instead of" \rightarrow Use this to contrast two methods (e.g., "It streams audio instead of waiting for the end of the sentence.")

🎓 Pro-Tip: The "Less Like" Pattern

Notice how the author describes the voice:

"...which makes the voice sound less like a machine."

To sound more natural in English, stop using "It is not a machine" (which is too simple). Use "less like [Noun]" to describe a quality that is improving.

  • Example: "With practice, your English sounds less like a textbook and more like a native speaker!"

Vocabulary Learning

flexible (adj.)
Able to change or be adapted easily to different circumstances.
Example:The company adopted a flexible working schedule to help employees balance their home lives.
continuous (adj.)
Forming an unbroken whole; without interruption.
Example:The continuous noise from the construction site made it difficult to concentrate.
detect (v.)
To discover or identify the presence or existence of something.
Example:The new security system can detect motion even in complete darkness.
filter out (v. phr.)
To remove something unwanted from a group or a signal.
Example:These noise-canceling headphones filter out the sound of the airplane engine.
integrated (adj.)
Combined into a single unit or system.
Example:The new software provides an integrated platform for managing all project tasks.
misuse (n.)
The act of using something in the wrong way or for the wrong purpose.
Example:The government passed new laws to prevent the misuse of personal data.
identified (v.)
Recognized or established who or what someone or something is.
Example:The witness successfully identified the suspect in a police lineup.
Practice B2 words in a crossword