4.7 Article

Utilizing unsupervised learning, multi-view imaging, and CNN-based attention facilitates cost-effective wetland mapping

Journal

REMOTE SENSING OF ENVIRONMENT
Volume 267, Issue -, Pages -

Publisher

ELSEVIER SCIENCE INC
DOI: 10.1016/j.rse.2021.112757

Keywords

Wetland mapping; Semantic segmentation; Multiscale CNN; Attention mechanism; Multi-view; UAV; Feature selection; Automation; Network pruning; Deep learning

Funding

  1. U.S. Department of Agriculture-Natural Resources Conservation Service
  2. U.S. Environ-mental Protection Agency
  3. Daugherty Water for Food Global Institute at the University of Nebraska
  4. U.S. Fish and Wildlife Service
  5. US Department of Agriculture-Natural Resources Conservation Service
  6. Rainwater Basin Joint Venture
  7. Nebraska Game and Parks Commission

Ask authors/readers for more resources

The combination of UAV data and deep learning, especially CNNs, provides robust tools for precision land cover mapping, although success relies on local experiences and cost-effective frameworks are needed. The Auto-UNet++ framework streamlines wetland mapping tasks with automatic strategies, achieving high accuracy and reducing human intervention.
The combination of Unmanned/Unoccupied Aerial Vehicle (UAV) data and deep learning, especially convolutional neural networks (CNNs), offers robust new tools for precision land cover mapping. However, its successful application is highly dependent on local experiences that are rarely documented, resulting in practical limitations during implementation. Cost-effective deep learning frameworks for fast deployment are required. This study presents a deep learning adaptation framework, named Auto-UNet++, trying to streamline wetland mapping tasks (including training data labeling and organizing). The framework treats mapping tasks as an intact semantic segmentation pipeline and then integrates automatic strategies into each step to reduce human intervention. These automatic strategies are achieved by standard computer vision techniques, including multi-view (MV) imaging-highly overlapped UAV images over an area (for labeling/voting), unsupervised clustering (for labeling), multi-scale CNN (for feature extraction), and attention mechanism-a CNN design used to select informative features from input (for feature exploration/selection). The framework was tested on playa wetland mapping in the Rainwater Basin, Nebraska, USA, with multispectral UAV datasets. Generally, the multi-scale CNN mapping task achieved a high of 87% overall accuracy and over 90% accuracy in water delineation. The results indicate that the multi-view and attention strategies have the potential to improve segmentation performance, and together with unsupervised learning, save considerable labor/expertise. Interestingly, evidence shows that the band/scale attention (weight) is adaptively associated with the land cover percentages per input image, indicating spatial contexts captured. This finding highlights the potential usages of the attention rule in automatic feature exploration, selection, and model interpretation. The framework illustrating a highly automated deep learning deployment on small MV datasets facilitates cost-effective wetland cover mapping. Although limitations exist, the study demonstrated the possibility of where/how conventional segmentation pipelines can be improved in typical UAV wetland mapping tasks. The framework and findings are useful for similar applications (including non-UAV studies) that only have limited time, labor, and expertise to implement sophisticated semantic segmentation models.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available