4.7 Article

Predicting Daily River Chlorophyll Concentrations at a Continental Scale

期刊

WATER RESOURCES RESEARCH
卷 59, 期 11, 页码 -

出版社

AMER GEOPHYSICAL UNION
DOI: 10.1029/2022WR034215

关键词

chlorophyll; rivers; machine learning

向作者/读者索取更多资源

Eutrophication is a major threat to aquatic ecosystems, and predicting chlorophyll a concentrations can help assess the trophic state and algal abundance. In this study, a large dataset of chlorophyll a concentrations from 82 streams and rivers across the United States was compiled, and a machine learning algorithm was used to predict daily chlorophyll a concentrations. The model showed strong correlations with observed data, but had lower accuracy when applied to completely new sites. Turbidity and total nitrogen were identified as the most important variables for predicting chlorophyll a.
Eutrophication is one of the largest threats to aquatic ecosystems and chlorophyll a measurements are relevant indicators of trophic state and algal abundance. Many studies have modeled chlorophyll a in rivers but model development and testing has largely occurred at individual sites which hampers creating generalized models capable of making broad-scale predictions. To address this gap, we compiled a large data set of chlorophyll a concentrations matched to other water quality, meteorological, and reach characteristic data for a diverse set of 82 streams and rivers across the United States. We used this data set and extreme gradient boosting, a tree-based machine learning algorithm, to predict daily chlorophyll a concentrations. Furthermore, we tested several practical considerations of broad-scale models, such as making predictions at sites not included in model training or the utility of in situ water quality data versus universally available remotely estimated model inputs. Predictions were very strongly correlated to observations when compared against a randomly withheld subset of days; however, the model had lower accuracy when applied to completely novel sites withheld from model training. Turbidity and total nitrogen were the two most important variables for predicting chlorophyll a. Although in situ variables improved modeled estimates and were identified as more important during model interpretation, using only remote inputs still resulted in highly correlated predictions with small bias. Testing a model across many sites allowed for identification of common variables relevant to chlorophyll a and highlighted several challenges for applying data-driven models to new sites or at larger spatial scales.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据