4.7 Article

Comparison of common classification strategies for large-scale vegetation mapping over the Google Earth Engine platform

Publisher

ELSEVIER
DOI: 10.1016/j.jag.2022.103092

Keywords

Crop mapping; Vegetation; Agriculture; Machine learning; Sentinel; Google Earth Engine

Categories

Funding

  1. National Natural Science Foundation of China
  2. Fudan Tyndall Centre of Fudan University
  3. [31961143006]
  4. [71774033]
  5. [IDH6286315]

Ask authors/readers for more resources

Vegetation resources play a crucial role in sustainable development, with monitoring land use and cover being key to their sustainable management. Earth Observation satellites have provided a powerful platform for this task. A classification framework based on random forests and Sentinel data was developed, revealing that certain strategies such as pixel time series features and class-balanced labels can enhance performance but may also lead to tradeoffs between recall and precision.
Vegetation resources have an essential role in sustainable development due to their close relationship with natural resource management and environmental protection. The monitoring of land use and cover is key for a more sustainable management of these resources, and Earth Observation satellites have provided an increasingly powerful platform for performing this task. To date, numerous classification algorithms have been developed for vegetation mapping, but a comparative evaluation of the strategies commonly used in large-scale applications (50000 km2 and above) is lacking. We developed a classification framework based on random forests and Sentinel data within the Google Earth Engine platform to assess the performance of various strategies over a complex landscape with a wide range of vegetation classes, plot configurations, and agricultural practices. These strategies differed in key aspects related to the characteristics of the classifier and the sample, and were evaluated by class-specific and general accuracy metrics. We found that the use of pixel time series features from fusion data, class-balanced labels, and multi-season time frames enhanced overall performance by 1.3%-14.1% over alternative approaches, but in some cases also generated tradeoffs of 1.6%-8.5% between recall and precision. In general, suboptimal strategies were particularly ineffective for the detection of infrequent classes. Finally, the different parameter values used in the random forests did not have a significant influence over the results. Our results demonstrate the importance of algorithm design in the effective classification of multiple vegetation classes, corroborate the usefulness of Sentinel data to generate mapping products of high resolution and accuracy, and highlight the importance of cloud computing tools for the development of vegetation mapping tools in large-scale applications. These findings provide general guidelines for the design of future classification frameworks, which are necessary for facilitating a sustainable management of natural resources worldwide.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available