Key points are not available for this paper at this time.
In this work, we address the long-standing problem of automatic recorded music denoising. In previous audio denoising research, the primary focus has been on speech, and music denoising works only considered noise types in indoor conversation scenarios or old gramophone recordings, neglecting the amateur music recording scenario. To this end, we first propose MusicECAN, an automatic music denoising method designed to filter out additional noise components in recorded music. The novel architecture comprises two key components, namely, a feature learning module and a noise filtering module, which can efficiently but effectively model, refine and denoise the noisy input. Specifically, in order to capture sufficient noisy music information, an ECA-U-SAM based feature learning module is designed by incorporating an efficient channel attention (ECA) mechanism into the traditional U-Net model with a supervised attention module (SAM). To train our MusicECAN, we collect M&N, a dataset containing various clean music and noise recordings. Through the combination of different clean-noise recording pairs, we can effectively simulate possible music performance environments with various background noise. Extensive quantitative and qualitative comparisons demonstrate that our MusicECAN outperforms the state-of-the-art audio denoising methods.
Building similarity graph...
Analyzing shared references across papers
Loading...
Haonan Cheng
Harbin University of Science and Technology
Shulin Liu
Communication University of China
Zhicheng Lian
Beijing Normal University
IEEE/ACM Transactions on Audio Speech and Language Processing
Tianjin University
Communication University of China
Building similarity graph...
Analyzing shared references across papers
Loading...
Cheng et al. (Mon,) studied this question.
synapsesocial.com/papers/6a1e57c360864841a668c80f — DOI: https://doi.org/10.1109/taslp.2024.3378118
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: