December 5, 2025Open Access

Development of robust machine learning models to estimate hydrochar higher heating value and yield based upon biomass proximate analysis

Key Points

Hydrochar yield and HHV prediction achieved significant accuracy using machine learning algorithms.
The best model, CatBoost, achieved R² of 0.98 for HHV and 0.94 for yield estimation.
Evaluation of data involved Monte Carlo Outlier Detection and a curated dataset of 481 samples.
Temperature and ash content were pivotal factors influencing HHV and yield predictions.

Abstract

Abstract This study introduces a robust machine learning framework for predicting hydrochar yield and higher heating value (HHV) using biomass proximate analysis. A curated dataset of 481 samples was assembled, featuring input variables such as fixed carbon, volatile matter, ash content, reaction time, temperature, and water content. Hydrochar yield and HHV served as the target outputs. To enhance data quality, Monte Carlo Outlier Detection (MCOD) was employed to eliminate anomalous entries. Thirteen machine learning algorithms, including convolutional neural networks (CNN), linear regression, decision trees, and advanced ensemble methods (CatBoost, LightGBM, XGBoost) were systematically compared. CatBoost demonstrated superior performance, achieving an R 2 of 0.98 and mean squared error (MSE) of 0.05 for HHV prediction, and an R 2 of 0.94 with MSE of 0.03 for yield estimation. SHAP analysis identified ash content as the most influential feature for HHV prediction, while temperature, water content, and fixed carbon were key drivers of yield. These results validate the effectiveness of gradient boosting models, particularly CatBoost, in accurately modeling hydrothermal carbonization outcomes and supporting data-driven biomass valorization strategies. Graphical abstract

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Guoliang Hou

Ahmad Alkhayyat

Ahmad Almalkawi

Journals

Bioresources and Bioprocessing

Actions

Institutions

Saveetha University

Chitkara University

Jain University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Development of robust machine learning models to estimate hydrochar higher heating value and yield based upon biomass proximate analysis

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study