Benchmarking a Catchment-Aware Long Short-Term Memory Network (LSTM) for Large-Scale Hydrological Modeling

Abstract. Regional rainfall-runoff modeling is an old but still mostly out-standing problem in Hydrological Sciences. The problem currently is that traditional hydrological models degrade significantly in performance when calibrated for multiple basins together instead of for a single basin alone. In this paper, we propose a novel, data-driven approach using Long Short-Term Memory networks (LSTMs), and demonstrate that under a big data paradigm, this is not necessarily the case. By training a single LSTM model on 531 basins from the CAMELS data set using meteorological time series data and static catchment attributes, we were able to significantly improve performance compared to a set of several different hydrological benchmark models. Our proposed approach not only significantly outperforms hydrological models that were calibrated regionally but also achieves better performance than hydrological models that were calibrated for each basin individually. Furthermore, we propose an adaption to the standard LSTM architecture, which we call an Entity-Aware-LSTM (EA-LSTM), that allows for learning, and embedding as a feature layer in a deep learning model, catchment similarities. We show that this learned catchment similarity corresponds well with what we would expect from prior hydrological understanding.

[1]  Martyn P. Clark,et al.  Benchmarking of a Physically Based Hydrologic Model , 2017 .

[2]  Yuqiong Liu,et al.  Reconciling theory with observations: elements of a diagnostic approach to model evaluation , 2008 .

[3]  Vijay P. Singh,et al.  The NWS River Forecast System - catchment modeling. , 1995 .

[4]  Hoshin Vijai Gupta,et al.  Large-sample hydrology: a need to balance depth with breadth , 2013 .

[5]  G. SCALE ISSUES IN HYDROLOGICAL MODELLING : A REVIEW , 2006 .

[6]  Luis Samaniego,et al.  Towards seamless large‐domain parameter estimation for hydrologic models , 2017 .

[7]  Hoshin Vijai Gupta,et al.  A process‐based diagnostic approach to model evaluation: Application to the NWS distributed hydrologic model , 2008 .

[8]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .

[9]  Wes McKinney,et al.  Data Structures for Statistical Computing in Python , 2010, SciPy.

[10]  Martyn P. Clark,et al.  Framework for Understanding Structural Errors (FUSE): A modular framework to diagnose differences between hydrological models , 2008 .

[11]  Sepp Hochreiter,et al.  Do internals of neural networks make sense in the context of hydrology , 2018 .

[12]  Leland McInnes,et al.  UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction , 2018, ArXiv.

[13]  Max D. Morris,et al.  Factorial sampling plans for preliminary computational experiments , 1991 .

[14]  Keith Beven,et al.  Equifinality, data assimilation, and uncertainty estimation in mechanistic modelling of complex environmental systems using the GLUE methodology , 2001 .

[15]  Luis Samaniego,et al.  Diagnostic Evaluation of Large‐Domain Hydrologic Models Calibrated Across the Contiguous United States , 2019, Journal of Geophysical Research: Atmospheres.

[16]  Tim R. McVicar,et al.  Global‐scale regionalization of hydrologic model parameters , 2016 .

[17]  Arun Kumar,et al.  Long‐range experimental hydrologic forecasting for the eastern United States , 2002 .

[18]  Luis Samaniego,et al.  Scaling, Similarity, and the Fourth Paradigm for Hydrology , 2017, Hydrology and earth system sciences.

[19]  Eric A. Anderson,et al.  National Weather Service river forecast system: snow accumulation and ablation model , 1973 .

[20]  Hoshin Vijai Gupta,et al.  Decomposition of the mean squared error and NSE performance criteria: Implications for improving hydrological modelling , 2009 .

[21]  C. Perrin,et al.  Does a large number of parameters enhance model performance? Comparative assessment of common catchment model structures on 429 catchments , 2001 .

[22]  Martyn P. Clark,et al.  Development of a large-sample watershed-scale hydrometeorological data set for the contiguous USA: data set characteristics and assessment of regional variability in hydrologic model performance , 2014 .

[23]  Soroosh Sorooshian,et al.  Toward improved calibration of hydrologic models: Multiple and noncommensurable measures of information , 1998 .

[24]  Jan Seibert,et al.  Regionalisation of parameters for a conceptual rainfall-runoff model , 1999 .

[25]  J. McDonnell,et al.  A decade of Predictions in Ungauged Basins (PUB)—a review , 2013 .

[26]  Dmitri Kavetski,et al.  Estimating mountain basin‐mean precipitation from streamflow using Bayesian inference , 2015 .

[27]  Donna M. Rizzo,et al.  Advances in ungauged streamflow prediction using artificial neural networks , 2010 .

[28]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[29]  Luca Antiga,et al.  Automatic differentiation in PyTorch , 2017 .

[30]  Anqi Wang,et al.  Practical Experience of Sensitivity Analysis: Comparing Six Methods, on Three Hydrological Models, with Three Performance Criteria , 2019, Water.

[31]  Paulin Coulibaly,et al.  Streamflow Prediction in Ungauged Basins: Review of Regionalization Methods , 2013 .

[32]  S. Attinger,et al.  Multiscale parameter regionalization of a grid‐based hydrologic model at the mesoscale , 2010 .

[33]  Jan Seibert,et al.  Teaching hydrological modeling with a user-friendly catchment-runoff-model software package , 2012 .

[34]  Linda L Kloss Framework for Understanding HIM in Managed Care , 1998 .

[35]  P. E. O'connell,et al.  IAHS Decade on Predictions in Ungauged Basins (PUB), 2003–2012: Shaping an exciting future for the hydrological sciences , 2003 .

[36]  Sepp Hochreiter,et al.  A glimpse into the Unobserved: Runoff simulation for ungauged catchments with LSTMs , 2018 .

[37]  Karsten Schulz,et al.  Rainfall–runoff modelling using Long Short-Term Memory (LSTM) networks , 2018, Hydrology and Earth System Sciences.

[38]  D. Lettenmaier,et al.  A simple hydrologically based model of land surface water and energy fluxes for general circulation models , 1994 .

[39]  F. Naef,et al.  Can we model the rainfall-runoff process today? , 1981 .

[40]  A. Jakeman,et al.  How much complexity is warranted in a rainfall‐runoff model? , 1993 .

[41]  J. Nash,et al.  River flow forecasting through conceptual models part I — A discussion of principles☆ , 1970 .

[42]  Martyn P. Clark,et al.  The CAMELS data set: catchment attributes and meteorology for large-sample studies , 2017 .

[43]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[44]  Martyn P. Clark,et al.  On the choice of calibration metrics for “high-flow” estimation using hydrologic models , 2019, Hydrology and Earth System Sciences.

[45]  Patrick M. Reed,et al.  Technical Note: Method of Morris effectively reduces the computational demands of global sensitivity analysis for distributed watershed models , 2013 .

[46]  Gaël Varoquaux,et al.  The NumPy Array: A Structure for Efficient Numerical Computation , 2011, Computing in Science & Engineering.

[47]  Jan Seibert,et al.  Upper and lower benchmarks in hydrological modelling , 2018 .

[48]  Dmitri Kavetski,et al.  Flow Prediction in Ungauged Catchments Using Probabilistic Random Forests Regionalization and New Statistical Adequacy Tests , 2019, Water Resources Research.

[49]  J. Kirchner Getting the right answers for the right reasons: Linking measurements, analyses, and models to advance the science of hydrology , 2006 .

[50]  Stefano Tarantola,et al.  Sensitivity Analysis in Practice: A Guide to Assessing Scientific Models , 2004 .

[51]  Sepp Hochreiter,et al.  NeuralHydrology - Interpreting LSTMs in Hydrology , 2019, Explainable AI.

[52]  Sabine Attinger,et al.  Implications of distributed hydrologic model parameterization on water fluxes at multiple scales and locations , 2013 .