4.7 Article

Predictive mapping of aquatic ecosystems by means of support vector machines and random forests

Journal

JOURNAL OF HYDROLOGY
Volume 595, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.jhydrol.2021.126026

Keywords

Machine learning; Freshwater; Ecosystem services; Big data; Supervised classification

Funding

  1. Ministerio de Ciencia, Innovacion y Universidades [RTI2018-099394-B-I00]
  2. Spain's Ministerio de Educacion, Cultura y Deporte [PRX18/00235]

Ask authors/readers for more resources

This study utilized machine learning to predict water bodies in a western Colombian catchment, demonstrating better performance in flat regions and decreased accuracy in mountainous areas.
In a context of rapid change, mapping aquatic ecosystems is crucial for their protection, particularly in regions of the world where a comprehensive inventory of water bodies is yet to be developed. This paper presents a machine learning methodology for predictive aquatic ecosystem mapping. Two state-of-the-art classifiers were used, namely support vector machines (SVM) and random forests (RFC). These were implemented in a GIS environment and trained on a sample of 350 points of a western Colombian catchment, where the presence or absence or water was known beforehand. Sixty percent of the dataset was used to train the algorithms and the remainder was used to calibrate predictive accuracy. Both classifiers rendered a realistic picture of the catchment's water bodies. RFC obtained a mean test score in excess of 0.93 and an area under the receiver operating characteristic curve of 0.95. SVM scored 0.89 and 0.89, respectively, after optimization. Performance was better overall in flat regions. SVM identified correctly over 87% of the surface area of lentic and 82% of lotic water bodies, while RFC scored over 80% in both cases. In mountain regions precision dropped to 69% and 62% for SVM and 65% and 76% for RFC. This leads to three conclusions: (a) the method provides satisfactory results at the regional scale and is versatile enough to export to other settings; (b) local-scale detail cannot be captured in the absence of an accurate high-resolution digital elevation model, particularly in mountain areas; (c) complementing standard machine learning metrics with ad hoc indicators is likely needed in cases where the input dataset consists solely of unambiguous examples.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available