Abstract Machine learning models often struggle when confronted with the complexities of real-world data, such as noise and incomplete information, which pose significant challenges for practical applications. This paper presents a systematic framework for assessing the robustness of machine learning models against these adversities. In particular, we investigate how noise affecting the most critical attribute of a dataset impacts on model predictions. Our methodology uses a variety of datasets, a wide range of machine learning models, and a customizable noise induction function, allowing for comprehensive evaluations tailored to specific application needs. Through an extensive experimental analysis, we identify the vulnerabilities of models when their critical attributes are perturbed. In addition, by applying hierarchical clustering, we categorise machine learning families based on their robustness, establishing a taxonomy that highlights the susceptibility of different models to noise. Our findings provide important insights into the interplay between model performance and noise levels and improve the understanding of model behaviour under adverse conditions.
Padró-Ferragut et al. (Mon,) studied this question.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: