4.6 Article

Using machine learning to improve predictions and provide insight into fluvial sediment transport

Journal

HYDROLOGICAL PROCESSES
Volume 36, Issue 8, Pages -

Publisher

WILEY
DOI: 10.1002/hyp.14648

Keywords

bedload; hysteresis; machine learning; sediment rating curve; sediment transport; suspended sediment

Funding

  1. Minnesota Agricultural Research Station [MIN-42-080]
  2. Minnesota Department of Natural Resources
  3. Minnesota Pollution Control Agency
  4. Minnesota's Clean Water Fund
  5. Peter F. Ffolliott Fellowship in the Department of Forest Resources, University of Minnesota
  6. U.S. Geological Survey Cooperative Matching Funds
  7. University of Minnesota
  8. USGS Cooperative Agreement [G20AC00427]

Ask authors/readers for more resources

A thorough understanding of fluvial sediment transport is crucial for addressing environmental concerns. This study developed and trained machine learning models to predict suspended sediment and bedload, using sampling data collected in Minnesota, US. The models successfully explained approximately 70% of the variance in the datasets, with important features identified using SHAP plots. This study is a promising step forward in predicting fluvial sediment transport using machine learning models, with applications in landscape and water management.
A thorough understanding of fluvial sediment transport is critical to addressing many environmental concerns such as exacerbated flooding, degradation of aquatic habitat, excess nutrients, and the economic challenges of restoring aquatic systems. Fluvial sediment samples are integral for addressing these environmental concerns but cannot be collected at every river and time of interest. Therefore, to gain a better understanding for rivers where direct measurements have not been made, extreme gradient boosting machine learning (ML) models were developed and trained to predict suspended sediment and bedload from sampling data collected in Minnesota, United States (U.S.), by the U.S. Geological Survey. Approximately 400 watershed (full upstream area), catchment (nearby landscape), near-channel, channel, and streamflow features were retrieved or developed from multiple sources, reduced to approximately 30 uncorrelated features, and used in the final ML models. The results indicate suspended sediment and bedload ML models explain approximately 70% of the variance in the datasets. Important features used in the models were interpreted with Shapley additive explanation (SHAP) plots, which provided insight into sediment transport processes. The most important features in the models were developed to normalize streamflow by the 2-year recurrence interval and quantify the rate of change in streamflow (slope), which helped account for sediment hysteresis. Generally, this study also showed a combination of mostly watershed and catchment geospatial features were important in ML models that predict sediment transport from physical samples. This study is a promising step forward in making fluvial sediment transport predictions using machine learning models trained by physically collected samples. The approach developed here can be used wherever similar datasets exists and will be useful for landscape and water management.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available