Intel plans lip-reading boost for voice recognition


Intel plans lip-reading boost for voice recognition

Intel has released software which enables a computer to perform a similar task to human lip-reading, as an aid to existing sound-based voice recognition systems.

The Audio Visual Speech Recognition (AVSR) software should improve the accuracy of speech-recognition software under difficult conditions, especially those involving background noise, Intel said.

The aim of AVSR is to enable computers to synchronise the video data captured on camera with the sound data to produce more accurate speech recognition.

AVSR is part of Intel's OpenCV computer vision library, a toolbox of imaging functions, which contains a number of face detection algorithms.

With the speed of today's microprocessors, falling camera prices and much greater video capture bandwidth from technologies like Universal Serial Bus 2, mainstream PCs are capable of running real-time computer vision algorithms, the company said.

OpenCV is an open-source code library which has seen more than 500,000 code downloads to date, Intel said.

Information about AVSR can be found at

Email Alerts

Register now to receive IT-related news, guides and more, delivered to your inbox.
By submitting your personal information, you agree to receive emails regarding relevant products and special offers from TechTarget and its partners. You also agree that your personal information may be transferred and processed in the United States, and that you have read and agree to the Terms of Use and the Privacy Policy.

COMMENTS powered by Disqus  //  Commenting policy