Key points are not available for this paper at this time.
This research investigates the predictive performance of ensemble learning models, specifically Bagging, when combined with weak models including Polynomial Regression (PR), K-Nearest Neighbors (KNN), and Gamma Regression (GR) to estimate drug solubility in supercritical carbon dioxide as the solvent. The models were trained and optimized using the Bat Algorithm (BA). The objective was to accurately predict two important properties: CO 2 density and the solubility of phenytoin in it. The bagging technique was applied to combine the predictions of multiple weak models, enhancing overall performance. The results demonstrated remarkable predictive capabilities of the Bagging model with Polynomial Regression (BAG + PR) for both CO 2 density and drug solubility. It achieved a high R 2 score of 0.9949 for CO 2 density and 0.97833 for solubility. The BAG + PR model also exhibited the lowest Root Mean Square Error (RMSE), indicating superior accuracy in predictions. Moreover, it exhibited the lowest Average Absolute Relative Deviation (AARD%) and Maximum Error, further validating its effectiveness in accurately capturing the relationships among the variables. Comparatively, the BAG + KNN and BAG + GR models also performed well but fell short of the BAG + PR model. While they showed respectable R 2 scores, their RMSE values were higher, suggesting larger prediction errors. The AARD% and Maximum Error metrics were also higher for these models, indicating less precise and more variable predictions.
Yu et al. (Fri,) studied this question.