What does this research mean for the field?

Inaudible ultrasonic signals can exploit microphone nonlinearity in modern hearables to generate phantom audible sounds that reliably deceive human users into following false instructions or perceiving fake environmental sounds. Novelty: ClaimNovelty.NOVEL_FINDING. Consensus alignment: ConsensusAlignment.ESTABLISHES_NEW_DIRECTION.

What question did this study set out to answer?

This research aims to investigate the security vulnerabilities of hearable devices against ultrasonic attacks that generate phantom sounds.

March 18, 2026

UltrasonicWhisper+: Ultrasonic Attacks Generate Phantom Sounds in Your Hearable

Key Points

This research aims to investigate the security vulnerabilities of hearable devices against ultrasonic attacks that generate phantom sounds.
Developed UltrasonicWhisper+ to exploit microphone nonlinearity
Evaluated five commercial hearables for sound quality using mean opinion scores
Conducted behavioral studies to assess user response to false instructions
Measured spatial localization accuracy of ultrasonic sounds
Mean opinion scores for sound quality ranged from 1.26 to 3.27 across devices
Short-time objective intelligibility scores varied from 0.44 to 0.75
Participants followed 28.0% of false instructions despite warnings
Spatialized ultrasonic sounds achieved 65.5% localization accuracy
False acceptance rate of 28.4% indicates high risk of deception

Abstract

Modern hearables, such as wireless earbuds with transparency mode, are designed to enhance user awareness by relaying ambient sounds. However, this functionality introduces a new attack surface. We present UltrasonicWhisper+, a novel attack that exploits microphone nonlinearity to inject inaudible ultrasound into hearables, resulting in the demodulation of phantom audible sounds delivered directly to the user. Unlike prior ultrasound-based attacks that target voice assistants, our method deceives users themselves by simulating either internal hearable audio or spatial environmental sounds. We evaluate five commercial hearables and find that demodulated sound quality varies by device, with mean opinion scores (MOS) ranging from 1.26 to 3.27 and short-time objective intelligibility (STOI) ranging from 0.44 to 0.75. Behavioral studies show that participants followed an average of 28.0% of false instructions even after being warned about the attack. Moreover, spatialized ultrasonic sounds achieved 65.5% localization accuracy (±45°), and a false acceptance rate of 28.4% when perceived as ambient sound. These findings demonstrate that users can be reliably deceived via inaudible audio signals, raising serious concerns for safety-critical applications. Our results call for a reexamination of hearable device security and highlight the need for robust countermeasures.

Bookmark

Cite This Study

Watanabe et al. (Mon,) studied this question.

synapsesocial.com/papers/69ba41e04e9516ffd37a1c12 https://doi.org/https://doi.org/10.1145/3789679

Bookmark