4.7 Article

Relative performance of different data mining techniques for nitrate concentration and load estimation in different type of watersheds

期刊

ENVIRONMENTAL POLLUTION
卷 263, 期 -, 页码 -

出版社

ELSEVIER SCI LTD
DOI: 10.1016/j.envpol.2020.114618

关键词

Data mining; Nitrate concentration; Watershed land use; Water pollution

资金

  1. USDA National Institute of Food and Agriculture, Hatch project [ILLU-741-379]
  2. Natural Science Foundation of China [51809195]
  3. Postdoctoral Science Foundation of China [2018M642083]
  4. National Water Pollution Control and Treatment Science and Technology Major Project of China [2017ZX07204004, 2017ZX07204002]

向作者/读者索取更多资源

The increasing availability of water quality datasets has led to a greater focus on hydrologic and water quality analysis, thus requiring more efficient and accurate modelling methods. Data mining techniques have been increasingly used for water quality analysis and prediction of the concentration and load of nitrogen pollutants instead of more traditional simulation methods. In this study, we tested the multilayer perceptron (MLP), k-nearest neighbor (k-NN), random forest, and reduced error pruning tree (REPTree) methods, along with the traditional linear regression, to predict nitrate levels based on longterm data from six watersheds with different land-use practices in the midwestern United States. Both the concentration and load results indicated that REPTree had the best performance, with an R-2 of 0.61 -0.85 and a relative absolute error of <75.8%. The different watershed types, however, influenced the performance of the data mining methods, where all four methods showed a higher accuracy for urban dominant watershed and lower accuracy for agricultural and forest watersheds. Out of these four methods, classification tree methods (REPTree and RF) performed better than cluster methods (MLP and k-NN) for agricultural and forested watersheds. Our results indicated that both the data structure based on the dominant land use and type of algorithmic method should be carefully considered for selecting a data mining method to predict nitrate concentration and load for a watershed. (c) 2020 Elsevier Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据