What question did this study set out to answer?

The aim is to develop and validate a machine learning model for predicting high-frequency hearing loss risk among noise-exposed workers.

January 17, 2026Open Access

An interpretable machine learning approach for predicting high-frequency hearing loss risk in occupational workers

Key Points

The aim is to develop and validate a machine learning model for predicting high-frequency hearing loss risk among noise-exposed workers.
Conducted a retrospective analysis of occupational health records from 5,037 workers exposed to noise.
Analyzed demographic data, occupational exposure history, and laboratory variables.
Compared multiple machine learning models, selecting CatBoost for its superior performance.
The CatBoost model achieved an AUC of 0.76, indicating good predictive performance.
Sensitivity was 0.71, and specificity was 0.68.
Age, noise exposure, and red blood cell count were identified as key predictors.
SHAP analysis produced individualized risk profiles for personalized assessment.

Abstract

Abstract Background High-frequency hearing loss (HFHL) is prevalent among noise-exposed workers, yet routine screening remains costly. This study develops and validates an interpretable machine learning model for predicting HFHL risk, aiming to provide a cost-effective tool for early detection and targeted intervention. Methods A retrospective analysis was conducted on occupational health records from 5,037 workers exposed to noise. Demographic data, occupational exposure history, and laboratory variables were analyzed. Multiple machine learning models were compared, with CatBoost selected due to its superior performance. Results The CatBoost model achieved an AUC of 0.76, with sensitivity of 0.71 and specificity of 0.68. Age, noise exposure, and red blood cell count were the most influential predictors. SHAP analysis provided individualized risk profiles, facilitating personalized risk assessment. Conclusions This interpretable machine learning model offers robust accuracy in predicting HFHL risk, supporting cost-effective screening and personalized occupational health strategies.

Read Full Paperexternally

Mark Helpful

Bookmark

Relay

View Full Paper