In this study, a novel and systematic prediction framework is proposed to optimize electrooxidation (EO) process parameters (anode material, applied current, reaction time, initial pH, type and concentration of supporting electrolyte, initial RIF concentration, and different real water matrices) during rifampin (RIF) degradation. This framework integrates machine learning (ML) algorithms (random forest, decision tree, gradient boosting, multilayer perceptron, support vector machine, and K-nearest neighbors) with various encoding strategies (one-hot, target, and label). Furthermore, hyperparameter optimization techniques (grid search, grid search with noise-augmented data, random search, and Bayesian optimization) were evaluated to enhance predictive performance and provide an in-depth examination of RIF degradation. The boron-doped diamond (BDD) anode achieved 100% RIF degradation under optimal conditions (i = 0.2 A, C RIF = 10 mg/L, SEc = 51 mM NaCl/L). In the performance evaluation of ML algorithms, the one-hot encoding for the DT algorithm yielded the best predictive accuracy, with the highest R 2 = 0.952 and Adjusted R 2 = 0.920 scores, alongside the lowest error metrics: MAE = 3.909, RMSE = 7.594, RSR = 0.217, AAD = 0.0624, and MAPE = 10.391. However, future importance analysis using Shapley Additive Explanations (SHAP), Gini impurity, and Mean Decrease in Accuracy (MDA) revealed that reaction time and anode materials are the most influential parameters for the EO process, whereas the electrolyte types and concentration demonstrated limited impact. Overall, the findings provide important insights into the EO process optimization and demonstrate the statistical robustness of the proposed ML-based framework. • The key parameters affecting RIF degradation during the EO process were comprehensively examined. • A new prediction framework was developed using ML algorithms and encoding strategies. • The framework bridged the gap between EO process optimization and statistical robustness. • ML performance was evaluated in detail using SHAP, Gini impurity, and MDA analyses. • Four fundamental hyperparameter optimization techniques were applied for an in-depth examination of RIF degradation.
Akhtar et al. (Sat,) studied this question.