What question did this study set out to answer?

The aim is to evaluate machine learning methods for estimating daily evaporation in small reservoirs in semi-arid regions.

April 12, 2026Open Access

Applications of machine learning in enhancing evaporation estimation for small reservoirs: a case study in semi-arid South Texas

Key Points

The aim is to evaluate machine learning methods for estimating daily evaporation in small reservoirs in semi-arid regions.
Developed a machine learning framework for estimating daily evaporation in small reservoirs.
Compared empirical models (Penman, Penman-Monteith, etc.) with ML models (RF, SVR, etc.)
Used meteorological data and reservoir attributes for training the models.
Validated findings against observational data from two sources (DLEM and TexasETNet).
Machine learning models (RF and SVR) showed better performance than empirical models with R2 values of 0.67.
RF had the closest alignment with DLEM observations and lower RMSE compared to other models.
Mean annual evaporation estimates from ML (1620–1698 mm yr⁻¹) exceeded those predicted by TexasETNet (1452 mm yr⁻¹).

Abstract

Small reservoirs in semi-arid regions experience substantial evaporative losses but are rarely monitored at daily scales. A multi-reservoir machine learning (ML) framework was developed to estimate daily open-water evaporation. Empirical models (Penman, Penman-Monteith, Priestley-Taylor, Bowen Ratio Energy Budget) andabenchmark combination method (Daily Lake Evaporation Model-DLEM) were compared against ML models. Predictors combined gridded meteorology (gridMET) with reservoir attributes (surface area, average depth, maximum depth, and fetch). ML models (Random Forest-RF, Decision Tree-DT, K-Nearest Neighbor-KNN, and Support Vector Regression-SVR) were trained on four reservoirs using data from 2018 to 2025. Results from ML models were further validated using both DLEM and TexasETNet observations at Delta Lake. On multi-reservoir tests, RF and SVR perform best (R2 = 0.67; RMSE ~ 1.5 mm d− 1; NSE = 0.67). RF aligns most closely with DLEM (R2 = 0.78; RMSE = 1.22 mm d− 1), whereas SVR aligns better with TexasETNet (R2 = 0.33) and shows lowest bias among ML models. Mean annual evaporation from ML and DLEM is 1620–1698 mm yr⁻¹, which exceeds predictions from TexasETNet (1452 mm yr⁻¹) and exhibits narrower interannual variability (± 300–350 mm yr ¹ vs. ±700 mm yr⁻¹). Feature importance and SHAP analyses identified shortwave radiation, temperature (Tmax/Tmin), and ET0 as dominant drivers. The proposed transferable ML framework delivers accurate daily estimates where observations are sparse, while dual validation clarifies differences between process-based and station datasets, supporting reservoir operations and evaporation mitigation planning in under monitored, semi-arid regions.

Bookmark

View Full Paper

Bookmark

View Full Paper

Applications of machine learning in enhancing evaporation estimation for small reservoirs: a case study in semi-arid South Texas

Key Points

Abstract

Cite This Study