What question did this study set out to answer?

This study aims to evaluate how deep learning and process-based models perform under various data uncertainties affecting streamflow estimation.

May 20, 2026Open Access

Do Deep‐Learning Models Perform Better Than Process‐Based Models? A Diagnostic Evaluation Using Synthetic Streamflow Simulation Under Various Sources of Uncertainties

Key Points

This study aims to evaluate how deep learning and process-based models perform under various data uncertainties affecting streamflow estimation.
Evaluated the SAC process-based model, LSTM deep learning model, and hybrid model under controlled data uncertainty scenarios.
Considered streamflow measurement error, precipitation input error, and out-of-sample conditions in the analysis.
Used synthetic streamflow data generated from a PBM as the baseline for comparison.
DLMs showed higher sensitivity to measurement errors than PBMs, suggesting PBMs are better for low-quality data.
In gauged and ungauged basins, DLMs were less sensitive to input errors compared to PBMs.
All models exhibited increased sensitivity to input errors during the testing phase; DLMs struggled with extreme flood under out-of-sample conditions.

Abstract

Abstract Deep learning models (DLMs) have gained attention for estimating daily streamflow, often outperforming traditional process‐based models (PBMs). However, only recently have studies begun comparing the performance of DLMs and PBMs under uncertain data conditions, and the existing work overlooks streamflow measurement errors, consider only a subset of uncertainties, and does not examine how different training schemes affect DLM sensitivity. This study evaluates the sensitivity of PBM (Sacramento, SAC), DLM (long short‐term memory, LSTM), and a hybrid model (LSTM incorporating SAC outputs) performance to data uncertainties: streamflow measurement error, precipitation input error, and out‐of‐sample (OOS) conditions. Precipitation and synthetic streamflow, estimated by a population PBM, are considered error‐free and used to construct controlled uncertainty scenarios, enabling a systematic comparison of model sensitivities under known‐error conditions. Results show that DLMs are more sensitive than PBMs to measurement errors, making PBMs preferable when streamflow data quality is low. However, DLMs are less sensitive to input errors than PBMs in gauged and ungauged basins. All models exhibit substantially higher sensitivity to input error during the testing period than training. DLMs struggle to capture changes in extreme flood under OOS conditions, whereas the PBM effectively captures these events, suggesting their suitability for applications under OOS conditions. Moreover, multi‐basin training can effectively reduce sensitivity to measurement errors in DLMs. All models demonstrate increased sensitivity in basins characterized by high elevation, low temperature, and dry conditions. This study provides timely and practical insights for selecting appropriate modeling approaches, especially given the growing global use of DLMs.

Do Deep‐Learning Models Perform Better Than Process‐Based Models? A Diagnostic Evaluation Using Synthetic Streamflow Simulation Under Various Sources of Uncertainties

Key Points

Abstract

Cite This Study