What question did this study set out to answer?

The aim is to enhance predictive accuracy in regression models for material chemistry using SHAP values.

March 19, 2026Open Access

Leveraging SHAP values for superior prediction and efficient Bayesian optimization in material chemistry

Key Points

The aim is to enhance predictive accuracy in regression models for material chemistry using SHAP values.
Implemented feature extraction method using SHAP values for regression analysis.
Compared performance of various base models for SHAP-based feature extraction, focusing on random forest.
Investigated the efficiency of material exploration during Bayesian optimization.
SHAP values improved predictive accuracy for underfitting regression models.
Random forest outperformed other models in capturing complex non-linear relationships.
The proposed method enhanced the efficiency of material exploration in Bayesian optimization.

Abstract

Abstract In recent years, machine learning has played a crucial role in data-driven material development. This study presents a feature extraction method for enhancing the predictive accuracy of regression models. The proposed method is examined using SHapley Additive exPlanations (SHAP) values, which are commonly used for interpreting black-box models, to determine whether it can transfer the high expressiveness of an accurate regression model to underfitting regression models. It was revealed that SHAP values can capture valuable information for regression analysis, resulting in improved predictive accuracy. The results also underscore the importance of base model selection to extract SHAP values, whose effectiveness is significantly influenced by the base model. Random forest demonstrated superior performance for SHAP-based feature extraction, presumably because of its ability to capture complex non-linear relationships, regardless of the specific SHAP explainer used. In addition, the proposed method can improve material exploration efficiency during Bayesian optimization. Graphical abstract

Bookmark

View Full Paper

Bookmark

View Full Paper

Leveraging SHAP values for superior prediction and efficient Bayesian optimization in material chemistry

Key Points

Abstract

Cite This Study