Home » How does Siri learn to speak Shanghainese? -INSIDE

How does Siri learn to speak Shanghainese? -INSIDE

by admin

Apple, Amazon, Microsoft, and Google all provide voice assistant services. Which one is better? According to a Reuters report, Apple’s voice assistant Siri may no longer have advantages in recognizing voices and answering questions, but one of Siri’s advantages is that it can speak the most languages. Now we are about to learn to speak Shanghainese. Let’s see how it does it. To. Before entering this article, think about how to say the following words in English:

Tailor-made (B) (speaking) vague (C) large-scale


The voice-assistant wars are (1) in full swing, with Apple, Amazon, Microsoft and now Google all offering electronic assistants to take your commands.

Apple, Amazon, Microsoft, and Google have all launched voice assistant services that accept human commands, and a voice assistant battle has begun.

Many researchers believe that Apple has squandered its lead when it comes to understanding speech and answering questions. However there is at least one thing Siri can do that the other assistants cannot: speak 21 languages localized for 36 countries, a very important capability in a smartphone market where most sales come from outside the United States.

Many researchers believe that Apple’s lead in speech recognition and answering questions has been exhausted, but there is one thing currently only Siri can do: speaking 21 languages ​​in 36 countries. This feature is extremely important in the smartphone market, because most smartphones are sold outside the United States.

Microsoft Cortana, by contrast, has eight languages (A) tailored for 13 countries. Google”s Assistant, which began in its Pixel phone but has since moved to other Android devices, speaks four languages. Amazon’s Alexa features only English and German. Siri will even soon start to learn Shanghainese, a special dialect of Wu Chinese spoken only around Shanghai. 


See also  Influenza: bookings are underway to vaccinate healthy children aged 6 months to 6 years

Microsoft Cortana has developed 8 languages ​​for 13 countries. Google Assistant speaks 4 languages. This service comes from Google’s own mobile phone Pixel, which is now available for other Android phones. Amazon’s Alexa only speaks English and German. And Siri is about to start learning Shanghai dialect, which is a Wu dialect spoken only in Shanghai and its surrounding areas.



At Apple, the company starts working on a new language by bringing in humans to read passages in a range of accents and dialects, which are then transcribed by hand so the computer has an exact representation of the spoken text to learn from, said Alex Acero, head of the speech team at Apple. Apple also captures a range of sounds in a variety of voices. From there, an acoustic model is built that tries to predict word sequences.

Alex Acero, head of Apple’s voice team, said that when new language functions are to be developed, real people with various dialects and accents will be asked to read paragraphs of text, and then manually transcribed, so that the computer can have accurate learning samples. Apple will also capture a variety of voices from different sounds, and then build an acoustic model to try to predict the sequence of characters.



Apple then deploys “dictation mode,” its text-to-speech translator, in the new language, Acero said. When customers use dictation mode, Apple captures a small percentage of the audio recordings and makes them
anonymous. The recordings, complete with background noise and (B) mumbled words, are transcribed by humans, a process that helps cut the speech recognition error rate in half. 


See also  Nothing Phone(2) confirms that it will launch this summer with Snapdragon 8+ and take the high-efficiency route

Acero said that Apple will then deploy “dictation mode” in the new language, a translator between text and speech. When the user uses the dictation mode, Apple will grab a small part of the audio recording and then anonymize it. These recordings contain background noise and ambiguous words. Transcribed by real people can reduce the error rate of speech recognition by half.



After enough data has been gathered and a voice actor has been recorded to play Siri in a new language, Siri is released with answers to what Apple expects will be the most common questions, Acero said. Once released, Siri learns more about what real-world users ask and is updated every two weeks with more tweaks. 


After collecting enough information and the voice actor to record the voice for Siri speaking in the new language, Siri can publish it. When it was released, Siri was able to answer the most common questions Apple expected. After the release, Siri can also learn from users’ actual problems, and make adjustments and updates every two weeks.



However, script-writing does not (C) stairs, said Charles Jolley, creator of an intelligent assistant named Ozlo. “You can’t hire enough writers to come up with the system you’d need in every language. You have to synthesize the answers,” he said.

However, Charles Jolley, the creator of the intelligent assistant Ozlo, said that writing scripts cannot be scaled. “It is impossible to hire enough authors to build the system required for each language. The answers must be artificially synthesized.”

See also  Privacy has bored (the elites): it is a mistake



The founders of Viv, a startup founded by Siri’s original creators that Samsung acquired last year, is working
on just that. “Viv was built to specifically address the scaling issue for intelligent assistants,” said Dag Kittlaus, the CEO and co-founder of Viv. “The only way to leapfrog today’s limited fuctionality versions is to open the system up and let the world teach them.” 


Viv, the start-up company of “Father of Siri”, is working to solve this problem. This company was acquired by Samsung last year. Dag Kittlaus, Viv’s co-founder and CEO, said: “Viv wants to solve the problem of scaling smart assistants. If you want to upgrade today’s limited-function versions, the only way is to open the system and let the world teach them.”

1. In full swing is in full swing;

By ten o’clock, the party was in full swing.

By ten o’clock, the party had reached a climax.

.

You may also like

Leave a Comment

This site uses Akismet to reduce spam. Learn how your comment data is processed.

This website uses cookies to improve your experience. We'll assume you're ok with this, but you can opt-out if you wish. Accept Read More

Privacy & Cookies Policy