Google's AI Now Translates Your Speech In Your Exact Voice

Google's AI translator directly converts audio translations and keeps your voice and tone intact .

At some point or another audio translations have had to be used and in those times the distinction between the voice of the translation and the original one is highly noticeable. The most obvious change is the swap from a male voice to a female one, or vice versa.

Google's translation team has been working hard to minimize the audio changes, and its audio translator can now keep the voice and tone as close as possible to that of the original speaker.

RELATED: GOOGLE AIMS TO HELP PEOPLE WITH IMPAIRED SPEECH TO LIVE INDEPENDENTLY

There are still some noticeable, yet distinctly smaller, differences. These have been dramatically minimized in comparison to other translation engines.

How does it all work?

Google's AI translator directly converts the audio input to the audio output without any further in-between steps.

Traditionally speaking, translation systems convert audio into text, the text is then translated, and finally, the audio is resynthesized. Somewhere in the middle, the original voice is lost and a new, distinctly different, one is used in its stead.

What Google has done is to create and use a new system, named the 'Translatotron', an end-to-end speech-to-speech translation system. The Translatotron comprises three steps:

Audio spectrograms from input languages into output ones trained to map each other.
A conversion of spectrograms into an audio wave.
The third component layers the original speaker's voice back onto the final output.

What difference will this make?

This is a positive tick in the box for all matters linked to audio translation, not only due to the fact it creates more nuanced translations but because it also minimizes room for errors. As there are fewer steps in the translation process, there are fewer chances for mistakes to happen.

Kaynak: https://interestingengineering.com/googles-ai-now-translates-your-speech-in-your-exact-voice

1 Comment

emmyjohn09654

Dec 24, 2024

Google’s 'Translatotron' improves audio translations by preserving the speaker's voice, reducing errors. This end-to-end system converts audio to text and back, ensuring more accurate and natural results.For businesses, text blast bulk sms services can help reach a large audience efficiently, complementing such advances in translation technology.

Google's AI Now Translates Your Speech In Your Exact Voice

How does it all work?

What difference will this make?

Tekno-Forum

Special Offers