MACH-Flow

Machine learning for Swiss river flow estimation

Started
September 1, 2021
Status
Completed
Share this project

Abstract

River flow is a key component of the terrestrial water cycle and of special relevance for ecosystems and societies. Dependable information on river flow is essential for advancing both environmental research, and water resources management as well as for anticipating and mitigating floods and droughts.

To date, observations at gauging stations constitute the most accurate way to measure this quantity along river systems. However, observation networks only monitor a small number of locations, leaving many gaps on maps of regional river flow.

The proposed project MACH-Flow aims to bridge this gap by advancing our capabilities for estimating daily river flow at ungauged locations in Switzerland using data science methods. To this end, MACH-Flow will expand upon recent advances for modeling water-cycle variables by fusing sparse in situ observations with spatially continuous predictor variables using machine learning.

A special focus will be on making existing methods fit for application at the national scale, by tackling conceptual hurdles that arise when modeling daily river flow at the very high spatial resolution (i.e. 200×200 to 500×500 meters) that is relevant for water management. In particular, the MACH-Flow project will develop a machine-learning based reconstruction of spatially resolved daily river flow covering all of Switzerland. This product will be the first of its type and be relevant for a range of practical and scientific applications.

People

Collaborators

SDSC Team:
William Aeberhard

PI | Partners:

ETH Zurich, Institute for Atmospheric and Climate Science:

  • Dr. Lukas Gudmundsson
  • Dr. Basil Kraft
  • Dr. Etienne Fluet-Chouinard
  • Prof. Sonia I. Seneviratne

More info

WSL, Hydrological Forecasts Group:

  • Dr. Massimiliano Zappa
  • Dr. Michael Schirmer

More info

description

Motivation

Accurately predicting river discharge, in particular at ungauged locations, is crucial for water resource management as well as drought and flood monitoring. Traditional hydrological models are generally calibrated on data from a single river catchment. They perform quite well when forecasting at this particular location (satisfactory temporal prediction) but they often generalize poorly at other locations (unsatisfactory spatial prediction). The main goal of this project was to develop models which can predict in space as well as in time. To this end we made use of data from many river gauging stations within a river network and modeled them jointly.

Proposed Approach / Solution

We developed two models in parallel. The first one is a recurrent neural network with long short-term memory (LSTM) cells, following the hydrological state-of-the-art architectures. Here, all gauging stations are considered independent and they share parameters (Figure 1). An extension we implemented in a follow-up paper involved a loss function based on a generalization of the continuous ranked probability score (negative energy score, tailored for multivariate probabilistic forecasts). This allowed to build ensemble predictions which quantify predictive uncertainty in a reliable way. The second model builds on first principles with an additive model with ridge-penalized cubic B-splines. This model includes engineered features for both short- and long-term effects of precipitation and temperature. We developed an additional independent module which can incorporated to this additive model. This module routes water through a river network, by encoding the latter as a directed graph. We have illustrated with a benchmark hydrological model that such spatio-temporal routing can only improve predictions, and this mainly for more downstream locations (Figure 2). A future research avenue, beyond MACH-Flow, is to combine this water routing module (as graph-based convolutions) with a recurrent neural network.

Impact

First, our LSTM neural network is on par with state-of-the-art hydrological models for Switzerland, albeit with much less inputs. This allowed us to reconstruct river discharge at a national scale back to the early 1960s, which is the earliest time for which precipitation and temperature data products are available. This reconstruction is important for the hydrological community as its low computation cost allows one to test various climate change scenarios. Second, the routing model, released as an independent module, is useful for the community as it can be easily added to any local prediction model to improve downstream predictions. Compared to other routing schemes in the hydrological literature, ours require minimal data inputs and accommodates any space discretization (regular grids or others). Also, it can improve predictions even with discharge data only available at a single station. Finally, a version of our additive model has been added to the ensemble of models used operationally on the drought monitoring platform drought.ch maintained by our partners at WSL.

Figure 1: Figure 7 from Kraft et al. (2024), spatially contiguous reconstruction of runoff from 1962 to 2023. The maps represent the yearly catchment-level runoff quantiles relative to the reference period (1971 to 2000) empirical distribution. The bottom bars show the national decadal deviation in mm/year of the national-level runoff relative to the reference period (1971 to 2000).
Figure 2: Routed predicted discharge in m^3/s on November 25, 2012 (day with minimum average discharge over our training time window). The routing module here is applied on the (unrouted) local predictions from the benchmark model PREVAH (Viviroli et al., 2009).

Gallery

Annexe

Additional resources

Bibliography

  1. Ghiggi, G., Humphrey, V., Seneviratne, S. I., and Gudmundsson, L. (2021). G-RUN ENSEMBLE: A multi-forcing observation-based global runoff reanalysis. Water Resources Research, 57 (5), 1-13. https://doi.org/10.1029/2020WR028787
  2. Jia, X. et al. (2021). Physics-Guided Recurrent Graph Model for Predicting Flow and Temperature in River Networks. Proceedings of the 2021 SIAM International Conference on Data Mining (SDM), 612-620. The Society for Industrial and Applied Mathematics: https://epubs.siam.org/doi/10.1137/1.9781611976700.69
  3. Asadi, P., Davison, A. C., and Engelke, S. (2015). Extremes on river networks. Ann. Appl. Stat. 9 (4), 2023-2050. Extremes on river networks: https://projecteuclid.org/journals/annals-of-applied-statistics/volume-9/issue-4/Extremes-on-river-networks/10.1214/15-AOAS863.full

Publications

Kraft, B.; Stalder, S.; Aeberhard, W. H.; Harrington Ruiz, N.; Meinshausen, N.; Shen, X.; Gudmundsson, L. "Modeling uncertainty with engression: a deep generative time-series approach" Submitted to Geophysical Research Letters 2025 View publication
Kraft, B.; Schirmer, M.; Aeberhard, W. H.; Zappa, M.; Seneviratne, S. I.; Gudmundsson, L. "CH-RUN: a deep-learning-based spatially contiguous runoff reconstruction for Switzerland" Hydrology and Earth System Sciences 29 4 1061-1082 2025 View publication

Related Pages

More projects

CHUV: Heracles - Sepsis Model

Completed
CHUV: Improving Sepsis detection and quality of care with AI
Health & Biomedical
Public sector

AI-Driven Political Monitoring

Completed
Legislative tracking for labor advocacy at Kaufmännischer Verband Schweiz
Digital Society
Private sector

LUCID National Data Stream

In Progress
Low Value of Care in Medical Hospitalized Patients - a National Data Stream on Quality of Care in Swiss University Hospitals
Health & Biomedical

Syngenta: Steam consumption optimization

Completed
Reliable strategies to save energy in Syngenta’s Kaisten plant
Energy & Sustainability
Private sector

News

Latest news

Coding the Future: Energy Data Hackdays Expand to French-speaking Switzerland
May 7, 2026

Coding the Future: Energy Data Hackdays Expand to French-speaking Switzerland

Coding the Future: Energy Data Hackdays Expand to French-speaking Switzerland

Held at the SDSC headquarters at Biopôle, the Energy Data Hackdays gather 100 experts to tackle 5 energy and grid challenges.
Science des données : le SDSC et le Canton de Vaud soutiennent quatre projets appliqués
April 30, 2026

Science des données : le SDSC et le Canton de Vaud soutiennent quatre projets appliqués

Science des données : le SDSC et le Canton de Vaud soutiennent quatre projets appliqués

Le SDSC et le Canton de Vaud ont retenu quatre projets parmi les 57 soumissions reçues lors de leur deuxième appel à projets.
Le Swiss Data Science Center inaugure son siège au Biopôle de Lausanne
March 12, 2026

Le Swiss Data Science Center inaugure son siège au Biopôle de Lausanne

Le Swiss Data Science Center inaugure son siège au Biopôle de Lausanne

Le SDSC inaugure aujourd'hui son siège au campus Biopôle de Lausanne, dans le cadre d'un partenariat stratégique avec l'État de Vaud.

Contact us

Let’s talk Data Science

Do you need our services or expertise?
Contact us for your next Data Science project!