What question did this study set out to answer?

The aim is to enhance the detection of anomalies in operational sounds by improving spectral details and classification approaches.

February 5, 2026Open Access

Anomalous Sound Detection by Fusing Spectral Enhancement and Frequency-Gated Attention

Puntos clave

The aim is to enhance the detection of anomalies in operational sounds by improving spectral details and classification approaches.
Developed FGASpecNet model for anomaly detection.
Employed a spectral enhancement branch for better spectral detail.
Implemented frequency-gated attention to focus on key frequency bands.
Utilized a joint loss design combining classification and metric learning loss.
Exploited multi-center prototypes for improved intra-class and inter-class structures.
Achieved 95.04% average AUC in detection performance.
Recorded 89.68% pAUC, indicating strong anomalous sound detection.
Demonstrated successful performance on DCASE 2020 Challenge Task 2.

Resumen

Unsupervised anomalous sound detection aims to learn acoustic features solely from the operational sounds of normal equipment and identify potential anomalies based on these features. Recent self-supervised classification frameworks based on machine ID metadata have achieved promising results, but they still face two challenges in industrial acoustic scenarios: Log-Mel spectrograms tend to weaken high-frequency details, leading to insufficient spectral characterization, and when normal sounds from different machine IDs are highly similar, classification constraints alone struggle to form clear intra-class structures and inter-class boundaries, resulting in false positives. To address these issues, this paper proposes FGASpecNet, an anomaly detection model integrating spectral enhancement and frequency-gated attention. For feature modeling, a spectral enhancement branch is designed to explicitly supplement spectral details, while a frequency-gated attention mechanism highlights key frequency bands and temporal intervals conditioned on temporal context. Regarding loss design, a joint training strategy combining classification loss and metric learning loss is adopted. Multi-center prototypes enhance intra-class compactness and inter-class separability, improving detection performance in scenarios with similar machine IDs. Experimental results on the DCASE 2020 Challenge Task 2 for anomalous sound detection demonstrate that FGASpecNet achieves 95.04% average AUC and 89.68% pAUC, validating the effectiveness of the proposed approach.

Leer artículo completoexternamente

Me gusta

Guardar

Ver artículo completo