The growing need for sustainable and lightweight building materials has accelerated research on alternatives to conventional concretes, out of which Lightweight Expanded Clay Aggregate (LECA) concrete has emerged as a promising solution. However, the high porosity and nonlinear mechanical behavior of LECA concrete complicate the accurate prediction of compressive strength through conventional empirical models. The main focus of the paper is on identifying a comprehensive machine learning-based framework for modeling and predicting the 28-day compressive strength of LECA-based lightweight concrete. The dataset was created and preprocessed by using statistical normalization and correlation analysis. In this study, five supervised machine learning models—Multiple Linear Regression (MLR), Support Vector Regression (SVR), Random Forest (RF), eXtreme Gradient Boosting (XGBoost), and Categorical Boosting (CatBoost)—were developed and fine-tuned using a grid-search strategy combined with ten-fold cross-validation. The quality of the prediction made by each model was evaluated by means of standard performance indicators, such as the coefficient of determination (R2), root mean square error (RMSE), mean absolute error (MAE), and mean absolute percentage error (MAPE). After the evaluation, the models were subsequently compared and ranked according to the Gray Relational Analysis (GRA) method. The comparative assessment shows that CatBoost demonstrated the most reliable performance, achieving an R2 of 0.907, RMSE of 3.41 MPa, MAE of 2.47 MPa, and MAPE of 10.05%, outperforming the remaining algorithms. To interpret the significance of features, SHAP (Shapley Additive exPlanations) analysis was applied, which identified water and LECA content as the dominant factors influencing compressive strength, followed by the cement and fine aggregate proportions. The findings reveal that the ensemble-based gradient boosting model is capable of capturing intricate nonlinear interactions, as observed in the heterogeneous matrix of LECA concrete.
Nair et al. (Mon,) studied this question.