Skip to main content
  1. Home
  2. Computing
  3. Emerging Tech
  4. News

Google Deepmind develops most realistic sounding AI yet

Add as a preferred source on Google

While subtle, one of the biggest advances portrayed in movies like Her or Ex Machina was that the AI began to really sound like a fellow human. And in the realm of real-life tech, Google’s AI focus in recent years has similarly been to make computers sound more like us. And they’re getting much better at it.

The latest development to come out of Google’s Deepmind AI is called WaveNet and it samples different parts of human speech and models its own waveforms after the way they sound. It’s not perfect yet, but we’re definitely getting closer to voices that sound like they come from a person’s mouth, rather than from a computer’s speaker.

Recommended Videos

While it still sounds strange, the new AI speech certainly flows better than the kinds of responses you’ll get from Siri or Cortana, which chop up human speech and paste it back together in a way that makes individual pronunciations correct, but the flow of the speech is completely off. (That technique is known as concatenative text to speech, just so you know.)

The WaveNet option flows much better because it uses something called parametric text to speech, which generates it from scratch. Where it differs from traditional uses of that technique though is that Google’s AI models its audio on the waveforms of real human voices.

That’s difficult, because typically there are around 16,000 potential voice samples to be taken with every second of speech — that takes a lot of processing power to handle. To cut back on that, WaveNet uses a prediction engine to estimate what sample should come next in natural speech, using everything that has gone before as a guide.

The results are impressive. To give you a comparison, here’s a classic concatenative text-to-speech system:

That sounds like the sort of digital assistant voices we’re become used to in recent years. But here’s the new WaveNet system that Google has developed:

The cadence of the speech is much more realistic and though there is a general fuzziness to the audio, it’s not hard to imagine that being cleaned up post development.

The process can even be used to simulate different kinds of voices, for example, male and female:

The only problem now is that even though its predictions reduce the amount of required processing for this technique, it still takes too much processing to imagine standard smartphone hardware being capable of doing it in real time. At least for now.

For more information on these techniques, Google’s blog post offers a lot more detail and samples and it even posted a couple of papers on it here.

Jon Martindale
Jon Martindale covers how to guides, best-of lists, and explainers to help everyone understand the hottest new hardware and…
You may have to wait until 2027 for Macs with Apple’s best chips
Lighting, Purple, Computer Hardware

If you’ve been holding off on buying a new MacBook Pro because the next generation of Apple Silicon is just around the corner, you might want to reset your expectations.

A new report by Bloomberg’s Mark Gurman suggests Apple is making its biggest change yet to the Mac chip roadmap. Instead of releasing a full family of M6 processors like it has with every generation since the original M1, the company is reportedly planning to launch only the standard M6 chip first. The more powerful Pro and Max variants? They may not arrive until 2027, and they’ll reportedly skip the M6 branding altogether.

Read more
I found these two Prime Day flagship laptop deals for display snobs and practical buyers
Samsung has the sharper discount and OLED screen, while Microsoft is the simpler Windows clamshell buy under $1,000.
Samsung Galaxy Book5 Pro 360 front view showing tend mode.

A flagship laptop deal has to survive the full spec check: chip, RAM, storage, display, seller, and final price. These two listings pass that test in different ways, which is why they’re the first pair I’d compare before chasing louder Prime Day discounts.

Samsung Galaxy Book5 Pro 360

Read more
Your Windows 10 PC just got an extra year of security updates, here’s how to get it for free
Free Windows 10 security updates now run through 2027, with three easy enrollment options.
Windows 10

If you are still running Windows 10, Microsoft just handed you some breathing room. The company has quietly extended its free Extended Security Updates program for consumer devices by a full year, pushing the new cutoff to October 12, 2027.

The surprising part is that there was no big announcement. Microsoft simply updated its ESU support page and tucked an editor's note onto a year-old blog post, and that was that.

Read more