About Careers MedBlog Contact us
Medindia LOGIN REGISTER
Advertisement

World's First Human-Like Speech Recognition System by Microsoft

by Dr. Trupti Shirole on October 20, 2016 at 5:58 AM
Font : A-A+

 World's First Human-Like Speech Recognition System by Microsoft

A technology that accurately recognizes the words in a conversation like people do has been created by Microsoft researchers. This major breakthrough in the field of speech recognition may soon help people suffering from speech-related issues.

The team from Microsoft Artificial Intelligence and Research reported a speech recognition system that makes the same or fewer errors than professional transcriptionists.

Advertisement


The researchers reported a word error rate (WER) of 5.9%, down from the 6.3% WER the team reported in September 2016.

The 5.9% error rate is about equal to that of people who were asked to transcribe the same conversation, and it's the lowest ever recorded against the industry standard "Switchboard" speech recognition task.
Advertisement

"We've reached human parity. This is an historic achievement," said Xuedong Huang, the company's chief speech scientist in a Microsoft blog post.

The milestone means that, for the first time, a computer can recognize the words in a conversation as well as a person would.

In doing so, the team has beat a goal they set less than a year ago - and greatly exceeded everyone else's expectations as well.

"Even five years ago, I wouldn't have thought we could have achieved this. I just wouldn't have thought it would be possible," said Harry Shum, executive vice president who heads the Microsoft Artificial Intelligence and Research group.

The research milestone comes after decades of research in speech recognition, beginning in the early 1970s with DARPA, the US agency tasked with making technology breakthroughs in the interest of national security.

"This accomplishment is the culmination of over 20 years of effort," said Geoffrey Zweig, who manages the Speech & Dialog research group.

The milestone will have broad implications for consumer and business products that can be significantly augmented by speech recognition. That includes consumer entertainment devices like the Xbox, accessibility tools such as instant speech-to-text transcription and personal digital assistants such as Cortana.

"This will make Cortana (Microsoft personal assistant) more powerful, making a truly intelligent assistant possible," Shum said.

To reach the human parity milestone, the team used Microsoft's Computational Network Toolkit (CNTK), a home-grown system for deep learning that the research team has made available on GitHub via an open source license.

CNTK's ability to quickly process deep learning algorithms across multiple computers running a specialized chip called a graphics processing unit vastly improved the speed at which the team was able to do research and, ultimately, reach human parity.

Moving forward, the researchers are working on ways to make sure that speech recognition works well in more real life settings.

That includes places where there is a lot of background noise, such as at a party or while driving on the highway.

In the longer term, researchers will focus on ways to teach computers not just to transcribe the acoustic signals that come out of people's mouths, but instead to understand the words they are saying.

"The next frontier is to move from recognition to understanding," Zweig said.

Source: IANS
Advertisement

Advertisement
News A-Z
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
What's New on Medindia
Get Involved and Stand Up for Human Rights on Human Rights Day 2022
Coronary Artery Bypass Grafting
Macronutrients Calculator for Weight Loss
View all
Recommended Reading
News Archive
Date
Category
Advertisement
News Category

Medindia Newsletters Subscribe to our Free Newsletters!
Terms & Conditions and Privacy Policy.

More News on:
Language Areas in The Brain 

Most Popular on Medindia

Accident and Trauma Care Blood Pressure Calculator Color Blindness Calculator Find a Doctor Pregnancy Confirmation Calculator Drug - Food Interactions A-Z Drug Brands in India Find a Hospital Selfie Addiction Calculator Sanatogen
This site uses cookies to deliver our services.By using our site, you acknowledge that you have read and understand our Cookie Policy, Privacy Policy, and our Terms of Use  Ok, Got it. Close
×

World's First Human-Like Speech Recognition System by Microsoft Personalised Printable Document (PDF)

Please complete this form and we'll send you a personalised information that is requested

You may use this for your own reference or forward it to your friends.

Please use the information prudently. If you are not a medical doctor please remember to consult your healthcare provider as this information is not a substitute for professional advice.

Name *

Email Address *

Country *

Areas of Interests