What question did this study set out to answer?

The aim is to develop a real-time speech processing system that enhances listening comfort for users of hearing aids in noisy environments.

May 14, 2026

Real-time hearing assistance system combining multichannel speech enhancement and voice conversion for improved listening comfort

Key Points

The aim is to develop a real-time speech processing system that enhances listening comfort for users of hearing aids in noisy environments.
Developed a lightweight pipeline combining speech enhancement and voice conversion.
Utilized a distributed device with multi-sensor inputs including microphone arrays and smartphone mics.
Conducted evaluation experiments simulating hearing aid use to assess performance and user comfort.
Confirmed real-time operability of the system during simulations.
Reported improvements in subjective listening comfort, specifically perceived vocal softness.
Examined the impact of front-end speech enhancement parameters on voice conversion performance.

Abstract

We propose a real-time speech processing system for hearing assistance applications, combining speech enhancement (SE) and voice conversion (VC) in a lightweight pipeline. The system utilizes a low-latency, deep-learning-free SE module to extract the target speaker’s voice from a multichannel signal observed in noisy, multi-speaker environments, followed by a VC module that transforms the extracted voice to improve intelligibility and listening comfort. The system is designed to work with a distributed assistive device equipped with multi-sensor input, including both-ear microphone arrays and additional microphones in a smartphone at hand. Motivated by the need to improve the intelligibility of a specific speaker in hearing aid scenarios, the system focuses on extracting and converting the target voice in challenging auditory scenes. Evaluation experiments simulating hearing aid use confirmed its real-time operability and revealed improvements in subjective listening comfort, including perceived vocal softness. Furthermore, we investigated how the parameter settings of the front-end SE affect the downstream VC performance and discussed its optimal configuration for the overall pipeline.

Ask AI

Mark Helpful

Bookmark

Relay