4.7 Article

Predictive mapping of aquatic ecosystems by means of support vector machines and random forests

期刊

JOURNAL OF HYDROLOGY
卷 595, 期 -, 页码 -

出版社

ELSEVIER
DOI: 10.1016/j.jhydrol.2021.126026

关键词

Machine learning; Freshwater; Ecosystem services; Big data; Supervised classification

资金

  1. Ministerio de Ciencia, Innovacion y Universidades [RTI2018-099394-B-I00]
  2. Spain's Ministerio de Educacion, Cultura y Deporte [PRX18/00235]

向作者/读者索取更多资源

This study utilized machine learning to predict water bodies in a western Colombian catchment, demonstrating better performance in flat regions and decreased accuracy in mountainous areas.
In a context of rapid change, mapping aquatic ecosystems is crucial for their protection, particularly in regions of the world where a comprehensive inventory of water bodies is yet to be developed. This paper presents a machine learning methodology for predictive aquatic ecosystem mapping. Two state-of-the-art classifiers were used, namely support vector machines (SVM) and random forests (RFC). These were implemented in a GIS environment and trained on a sample of 350 points of a western Colombian catchment, where the presence or absence or water was known beforehand. Sixty percent of the dataset was used to train the algorithms and the remainder was used to calibrate predictive accuracy. Both classifiers rendered a realistic picture of the catchment's water bodies. RFC obtained a mean test score in excess of 0.93 and an area under the receiver operating characteristic curve of 0.95. SVM scored 0.89 and 0.89, respectively, after optimization. Performance was better overall in flat regions. SVM identified correctly over 87% of the surface area of lentic and 82% of lotic water bodies, while RFC scored over 80% in both cases. In mountain regions precision dropped to 69% and 62% for SVM and 65% and 76% for RFC. This leads to three conclusions: (a) the method provides satisfactory results at the regional scale and is versatile enough to export to other settings; (b) local-scale detail cannot be captured in the absence of an accurate high-resolution digital elevation model, particularly in mountain areas; (c) complementing standard machine learning metrics with ad hoc indicators is likely needed in cases where the input dataset consists solely of unambiguous examples.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据