What question did this study set out to answer?

The research aims to investigate and improve the impact of ear canal deformation on the quality of in-ear speech for better speech enhancement performance.

June 17, 2026Open Access

Exploring and Addressing Low-Quality Auxiliary Modality in Earable Dual-microphone Speech Enhancement

Key Points

The research aims to investigate and improve the impact of ear canal deformation on the quality of in-ear speech for better speech enhancement performance.
Developed a quality-aware speech enhancement solution named QuaSE.
Analyzed the effects of ear canal deformation on in-ear speech quality.
Implemented a training strategy utilizing quality-aware data selection and content-aware augmentation.
QuaSE improves PESQ by 6.27%, STOI by 4.54%, SI-SDR by 14.90%, and SegSNR by 11.93% compared to existing methods.
Demonstrated that the quality-aware fusion strategy enhances performance in other sensing tasks.

Abstract

To enhance the speech clarity in earable voice interaction scenarios, dual-microphone speech enhancement (SE) techniques with collaboration of in-ear and out-ear microphones have garnered significant attention from the research community. Nevertheless, existing dual-microphone SE techniques are established on a strong assumption: high-quality in-ear speech (auxiliary modality) could provide efficient complementary information to target airborne speech (primary modality) , which decreases the adaptation in the real world. In our work, we explore a key observation that air pressure imbalance caused by ear canal deformation (ECD) adversely affects the quality of in-ear speech, subsequently leading to a significant degradation in speech enhancement performance. To address this bottleneck issue, we design an efficient quality-aware speech enhancement solution, named QuaSE, which efficiently and dynamically fuses complementary information by assessing the quality variations of in-ear speech. Additionally, based on the analysis of spectral distortion induced by ECD, a training strategy including quality-aware data selection and content-aware augmentation is designed to improve the generalization capability of QuaSE. Extensive experiments demonstrate that QuaSE outperforms state-of-the-art techniques by 6.27%, 4.54%, 14.90%, and 11.93% in terms of PESQ, STOI, SI-SDR, and SegSNR. Moreover, we also validate that the proposed quality-aware fusion strategy can be modularly integrated into other sensing tasks, improving the fusion performance.

Read Full Paperexternally

Ask AI

Mark Helpful

Bookmark

Relay

View Full Paper