Scientists develop artificial intelligence sonar glasses: can recognize lips, with an accuracy of 95% 04/26 Update SLTechnology News&Howtos

Scientists develop artificial intelligence sonar glasses: can recognize lips, with an accuracy of 95%

2025-04-26 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > IT Information >

Shulou(Shulou.com)11/24 Report--

CTOnews.com, April 10 (Xinhua)-- researchers at Cornell University in the United States have developed a new technology that allows silent communication through sonar glasses. The glasses use miniature speakers and microphones to read the words the wearer says silently, allowing them to perform a variety of tasks without physical input.

The technology, led by Zhang Ruidong, a doctoral student at Cornell University, is an improvement on a similar project that uses a wireless headset while the previous model relies on a camera.

According to CTOnews.com, the sonar glasses use a silent speech recognition interface called EchoSpeech, which uses sonar to sense mouth movement and uses a deep learning algorithm to analyze echo features in real time. This enables the system to identify the words the wearer says silently with about 95% accuracy.

One of the most exciting prospects of this technology is that for people with language disorders, it can be used to silently enter a conversation into a speech synthesizer and then say the words out loud. Glasses can also be used to control music playback in quiet libraries or dictate messages at noisy concerts.

The technology is compact, low-power and does not invade privacy because there is no data leaving the user's phone. In this way, there are no privacy concerns. Glasses are very convenient to wear and are more practical and feasible than other available silent speech recognition technologies.

The researchers said that the system only needs a few minutes of training data to learn the user's voice patterns, and after learning, it can send and receive sound waves to the user's face and sense the movement of the mouth. at the same time, deep learning algorithm is used to analyze echo features. The system is currently able to recognize 31 isolated commands and a string of consecutive numbers, with an error rate of less than 10%.

The current version of the system provides about 10 hours of battery life and can communicate wirelessly with users' smartphones via Bluetooth. The smartphone processes and predicts all data and transmits the results to some "action keys" so that it can play music, interact with smart devices, or activate voice assistants.

Cornell University's Intelligent computer Interface Future interaction (SciFi) Laboratory is using a Cornell University funding program to explore the possibility of commercializing the technology.

Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.

*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.