What question did this study set out to answer?

This study investigates the use of machine learning to estimate Total Volatile Fatty Acids (TVFA) in anaerobic digestion bioreactors.

March 3, 2026Open Access

Explainable Machine Learning for Volatile Fatty Acid Soft-Sensing in Anaerobic Digestion: A Pilot Feasibility Study

Puntos clave

This study investigates the use of machine learning to estimate Total Volatile Fatty Acids (TVFA) in anaerobic digestion bioreactors.
Analyzed data from controlled CO2 biomethanisation experiments.
Benchmarking of multiple regression models including TabNet, ANNs, XGBoost, and LightGBM.
Model evaluation using cross-validated performance metrics.
TabNet model achieved R2 of 0.8551, indicating good predictive power.
RMSE of 0.0090 and MAE of 0.0067 demonstrated model accuracy.
pCO2 identified as the primary factor influencing TVFA predictions.

Resumen

Sustainable energy systems such as anaerobic digestion (AD) bioreactors exhibit complex nonlinear dynamics that complicate the monitoring of key stability indicators using conventional laboratory-based methods. As a preliminary investigation, this pilot study explores the feasibility of using machine learning-based soft sensing to estimate Total Volatile Fatty Acids (TVFA(M)) from routinely measured physicochemical parameters. Using a short-term laboratory dataset obtained from controlled CO2 biomethanisation experiments, several regression models were benchmarked, including an attention-based deep learning architecture (TabNet), multi-architecture artificial neural networks (ANNs), gradient-boosting ensembles (CatBoost, XGBoost, LightGBM), and classical kernel-based approaches. Model performance was evaluated under a cross-validated framework to assess predictive capability and consistency across folds within the limited experimental scope. Among the tested models, TabNet achieved highly competitive performance, yielding an R2 of 0.8551, an RMSE of 0.0090, and an MAE of 0.0067. To support model transparency and interpretability, Explainable Artificial Intelligence (XAI) techniques based on SHapley Additive exPlanations (SHAP) were applied, identifying pCO2 as the dominant contributor to TVFA(M) predictions within the studied operational range. The results demonstrate the potential of explainable machine learning models as soft sensors for TVFA(M) estimation under controlled laboratory conditions. Although restricted to controlled laboratory conditions and a short observation period, this pilot study demonstrates the potential of explainable machine learning models for TVFA(M) estimation and provides a methodological benchmark for future validation using larger and more diverse datasets.

Leer artículo completoexternamente

Me gusta

Guardar

Ver artículo completo

Cite This Study

Amangeldy et al. (Sun,) studied this question.

synapsesocial.com/papers/69a67ee0f353c071a6f0a71f https://doi.org/https://doi.org/10.3390/a19030183

Me gusta

Guardar

Ver artículo completo