Cell Phone Vibrations Reveal Remote Conversations

Pennsylvania State University

An emerging form of surveillance, "wireless-tapping," explores the possibility of remotely deciphering conversations from the tiny vibrations produced by a cell phone's earpiece. With the goal of protecting users' privacy from potential bad actors, a team of computer science researchers at Penn State demonstrated that transcriptions of phone calls can be generated from radar measurements taken up to three meters, or about 10 feet, from a phone. While accuracy remains limited - around 60% for a vocabulary of up to 10,000 - the findings raise important questions about future privacy risks.

They published their research in Proceedings of WiSec 2025: 18th ACM Conference on Security and Privacy in Wireless and Mobile Networks. The work builds upon a 2022 project in which the team used a radar sensor and voice recognition software to wirelessly identify 10 predefined words, letters and numbers with up to 83% accuracy.

"When we talk on a cell phone, we tend to ignore the vibrations that come through the earpiece and cause the whole phone to vibrate," said first author Suryoday Basak, doctoral candidate in computer science. "If we capture these same vibrations using remote radars and bring in machine learning to help us learn what is being said, using context clues, we can determine whole conversations. By understanding what is possible, we can help the public be aware of the potential risks."

Basak and his adviser, Mahanth Gowda, associate professor of computer science and engineering, who co-authored the paper, used a millimeter-wave radar sensor - the same type of technology used in self-driving cars, motion detectors and 5G wireless networks - to explore the potential for compact, radar-based devices that could be miniaturized to fit inside everyday objects like pens. Their experimental setup is only for research purposes, the researchers said, developed in anticipation of what bad actors could potentially create. They then adapted "Whisper," an open-source, large-scale speech recognition model powered by artificial intelligence (AI), to decode the vibrations into recognizable speech transcriptions.

"Over the last three years, there's been a huge explosion in AI capabilities and open-source speech recognition models," Basak said. "We can use these models, but they are catered more toward clean speech or everyday use cases, so we have to adapt them to recognize low quality, 'noisy' radar data."

/Public Release. This material from the originating organization/author(s) might be of the point-in-time nature, and edited for clarity, style and length. Mirage.News does not take institutional positions or sides, and all views, positions, and conclusions expressed herein are solely those of the author(s).View in full here.