With the development of various emerging structures, concrete-filled steel tubular (CFST) columns have become critical load-bearing components in key infrastructures such as subways and underground utility tunnels. Accurately predicting their ultimate bearing capacity (Nu) is essential for guaranteeing structural safety. To address the limitations of traditional empirical formulas and code-based calculation approaches, this paper proposes a prediction model for ultimate bearing capacity based on the CatBoost algorithm optimized by Random Search. Furthermore, the marginal contribution of each key feature to the prediction results is measured through interpretability analysis. First, a database containing 438 CFST column ultimate bearing capacity test cases was established, with key parameters such as geometric dimensions and material properties as input variables. Second, the predictive performance of six machine learning algorithms—CatBoost, LightGBM, Random Forest (RF), Gradient Boosting (GB), K-Nearest Neighbors (KNN), and XGBoost—was compared. A five-fold cross-validation integrated with a Random Search strategy was employed for joint hyperparameter optimization. The results show that the optimized CatBoost model significantly outperforms other algorithms and conventional design codes, achieving a coefficient of determination (R2) as high as 0.99 and a root mean square error (RMSE) of 174.29 kN. Furthermore, the SHAP (Shapley Additive exPlanations) method was used to perform global and local interpretability analyses of the prediction model. This not only quantified the individual contribution and interaction effects of each feature parameter on the bearing capacity but also revealed that geometric parameters are the primary influencing factor. This finding confirms a high degree of consistency between the prediction mechanism of the data-driven model and classical mechanical theories, effectively validating the model’s reliability. This study provides an efficient and reliable tool for the optimal design and rapid evaluation of CFST columns and establishes a new data-driven paradigm for the design and reinforcement of key components in underground structures.
Wang et al. (Mon,) studied this question.