Apple fixes Siri’s robotic-sounding voice with deep learning – hear the difference

Thursday, August 24, 2017 10:27 amThursday, August 24, 201721 Comments

News

“Ahead of the launch of iOS 11 this fall, Apple has published a research paper detailing its methods for improving Siri to make the voice assistant sound more natural, with the help of machine learning,” Abhimanyu Ghoshal reports for TNW.

“Beyond capturing several hours of high-quality audio that can be sliced and diced to create voice responses, developers face the challenge of getting the prosody – the patterns of stress and intonation in spoken language – just right,” Ghoshal reports. “That’s where machine learning comes in. With enough training data, it can help a text-to-speech system understand how to select segments of audio that pair well together to create natural-sounding responses.”

“The results speak for themselves (ba dum tiss): Siri’s navigation instructions, responses to trivia questions and ‘request completed’ notifications sound a lot less robotic than they did two years ago,” Ghoshal reports. “You can hear them for yourself at the end of this paper from Apple.”

Read more in the full article here.

MacDailyNews Take: Listen to the iOS 9 and iOS 10 samples vs. the new iOS 11 samples. The new Siri voice is a vast improvement!

SEE ALSO:
Deep learning for Siri’s Voice: On-device deep mixture density networks for hybrid unit selection synthesis – August 23, 2017

Apple’s ‘Project Morgan’ revealed

Steve Wozniak to speak at ‘AI Deep Dive’ event on October 22 in Las Vegas

21 Comments

Jamboy55

Thursday, August 24, 2017 at 10:33 am

Really, click bait adverts on response page? No I would never vote for Michelle Obama. She’d be worse then her husband as the president of the USA.

Reply
1. deasystems
  
  Thursday, August 24, 2017 at 3:30 pm
  
  “She’d be worse then her husband as the president…”
  
  That statement is missing a comma so let me fix that for you: “She’d be worse, then her husband as the president…” Perhaps you meant, “She’d be worse than her husband as the president”?
  
  Reply
troy

Thursday, August 24, 2017 at 10:50 am

Wow, it’s frighthen how natural it sounds, sky net is closer every day, and the whole explanation its very intresting. Well google and samsung, there you have how they did it, you can start trying to make your cheap imitations for the intellectually poor people.

Reply
foretravelblog

Thursday, August 24, 2017 at 11:11 am

She sounds perkier.

Reply
1. JWSC
  
  Thursday, August 24, 2017 at 5:41 pm
  
  I’m a big fan of perky! 😉
  
  Reply
Ari

Thursday, August 24, 2017 at 11:15 am

I like the Siri voice in iOS 10 better then iOS 11. In 11 Siri is too his girl Friday. She is like an eager young intern. In iOS 10 the voice is more stable and emotionally secure. Less eager beaver.

Reply
1. crisrod63
  
  Thursday, August 24, 2017 at 2:06 pm
  
  I agree.
  
  Reply
2. KingMel
  
  Thursday, August 24, 2017 at 3:04 pm
  
  The OS11 voice is smoother and more natural, and I suspect that it will be easier to hear and understand in noisy situations. But I will miss the slight huskiness of the original Siri voice. So will Raj…
  
  Reply
  1. Xennex1170
    
    Thursday, August 24, 2017 at 7:10 pm
    
    Nice BBT reference. 🙂
    
    Reply
3. bbock
  
  Friday, August 25, 2017 at 6:00 pm
  
  Yes, I wish they’d hired the actress that provided the material for the previous versions, but with the updated intelligence code. It would also be easier to compare the three samples.
  
  Reply
Shock Me

Thursday, August 24, 2017 at 11:17 am

OF All the problems I’ve had with Siri, her voice was never one of them.

Reply
1. WriterGuy
  
  Thursday, August 24, 2017 at 12:12 pm
  
  Totally agree. The voice improvement is nice. But I’d rather have those scientists work on the way Siri understands questions. Let’s be honest — the Google voice assistant is much better at getting answers than Siri.
  
  Reply
  1. KingMel
    
    Thursday, August 24, 2017 at 3:08 pm
    
    I wouldn’t know about that…I do not use Google because I do not trust Google. Google’s products might be wonderful, but I value my privacy far more than I value the utility of Google’s products and services. Besides, I believe that Apple will pull ahead again sooner or later.
    
    Reply
  2. bbock
    
    Friday, August 25, 2017 at 6:02 pm
    
    I also wish Siri would actually read responses. Often it just pulls up a web page to read when the answer should be something it could read aloud. I also would like the option to have it read text messages aloud. You can do that in CarPlay, but the normal version is just dumb. They also need to teach it things like what’s on TV.
    
    Reply
2. ripabo
  
  Thursday, August 24, 2017 at 4:40 pm
  
  Yes, Siri needs some “deep learning” in her head. There was never any issue with her voice. I don’t care how good she sounds/looks…if she’d dumb, spending the time together is tough.
  
  Reply
  1. Xennex1170
    
    Thursday, August 24, 2017 at 7:09 pm
    
    Bit hard with the amount of data available to Siri compared with her competitors. Apple’s future dedicated AI chip may help but Google is already countering with an onboard lite version of their TensorFlow in ‘Oreo’ as well as opensourcing it to accelerate broader development. A single AI ‘OS’ for all sizes of platforms if you will.
    
    Reply
Josh

Thursday, August 24, 2017 at 11:37 am

The 11 voice is very bubbly and personal. Kind of like the bubbly gal who calls me several times a month trying to sell me time-share points. She drops her headphones but keeps on talking. I can’t shut her up even when I told her my son was arrested for dealing drugs to minors and the dog died.

Reply
Rettogo

Thursday, August 24, 2017 at 11:47 am

Solves one issue I’ve had for a while. Will be more natural using Siri. Now can I please use her to turn on and off my Hue lights without the request having to go to Apple first???

Reply
trondude

Thursday, August 24, 2017 at 3:48 pm

She sounds a bit nasal.

Reply
Derek Currie

Sunday, August 27, 2017 at 7:12 am

iOS 9 vs iOS 10 female Siri voice: WONDERFUL improvements.

A) Even more lifelike voice.
B) No longer sounds like Madonna (IOW it no longer gags me).

In historical futurism, ‘robot voices’ were monotonic. When the original Mac (1984) could talk better than monotonic, it was considered remarkable. Microsoft attempted something similar and came up with shite voices that sound like Dr. Stephen Hawking. (Sorry Doc). I was personally most pleased when the original Victoria voice was released. *love* But then Apple mangled it in an attempt to make her sound more animated. *fail*

The best third party voices, IMHO, are from Acapela Group. I particularly enjoy their Peter, Rosie and Queen Elizabeth voices. 👑

Reply
1. Derek Currie
  
  Sunday, August 27, 2017 at 7:13 am
  
  Correction:
  iOS 9 vs iOS 11 female Siri voice: Wonderful improvements.
  
  Reply