☆ 4.7 Article

Utilizing unsupervised learning, multi-view imaging, and CNN-based attention facilitates cost-effective wetland mapping

REMOTE SENSING OF ENVIRONMENT (2021)

Journal

REMOTE SENSING OF ENVIRONMENT

Volume 267, Issue -, Pages -

Publisher

ELSEVIER SCIENCE INC

DOI: 10.1016/j.rse.2021.112757

Keywords

Wetland mapping; Semantic segmentation; Multiscale CNN; Attention mechanism; Multi-view; UAV; Feature selection; Automation; Network pruning; Deep learning

Funding

U.S. Department of Agriculture-Natural Resources Conservation Service
U.S. Environ-mental Protection Agency
Daugherty Water for Food Global Institute at the University of Nebraska
U.S. Fish and Wildlife Service
US Department of Agriculture-Natural Resources Conservation Service
Rainwater Basin Joint Venture
Nebraska Game and Parks Commission

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

The combination of UAV data and deep learning, especially CNNs, provides robust tools for precision land cover mapping, although success relies on local experiences and cost-effective frameworks are needed. The Auto-UNet++ framework streamlines wetland mapping tasks with automatic strategies, achieving high accuracy and reducing human intervention.

The combination of Unmanned/Unoccupied Aerial Vehicle (UAV) data and deep learning, especially convolutional neural networks (CNNs), offers robust new tools for precision land cover mapping. However, its successful application is highly dependent on local experiences that are rarely documented, resulting in practical limitations during implementation. Cost-effective deep learning frameworks for fast deployment are required. This study presents a deep learning adaptation framework, named Auto-UNet++, trying to streamline wetland mapping tasks (including training data labeling and organizing). The framework treats mapping tasks as an intact semantic segmentation pipeline and then integrates automatic strategies into each step to reduce human intervention. These automatic strategies are achieved by standard computer vision techniques, including multi-view (MV) imaging-highly overlapped UAV images over an area (for labeling/voting), unsupervised clustering (for labeling), multi-scale CNN (for feature extraction), and attention mechanism-a CNN design used to select informative features from input (for feature exploration/selection). The framework was tested on playa wetland mapping in the Rainwater Basin, Nebraska, USA, with multispectral UAV datasets. Generally, the multi-scale CNN mapping task achieved a high of 87% overall accuracy and over 90% accuracy in water delineation. The results indicate that the multi-view and attention strategies have the potential to improve segmentation performance, and together with unsupervised learning, save considerable labor/expertise. Interestingly, evidence shows that the band/scale attention (weight) is adaptively associated with the land cover percentages per input image, indicating spatial contexts captured. This finding highlights the potential usages of the attention rule in automatic feature exploration, selection, and model interpretation. The framework illustrating a highly automated deep learning deployment on small MV datasets facilitates cost-effective wetland cover mapping. Although limitations exist, the study demonstrated the possibility of where/how conventional segmentation pipelines can be improved in typical UAV wetland mapping tasks. The framework and findings are useful for similar applications (including non-UAV studies) that only have limited time, labor, and expertise to implement sophisticated semantic segmentation models.

Utilizing unsupervised learning, multi-view imaging, and CNN-based attention facilitates cost-effective wetland mapping

Journal

REMOTE SENSING OF ENVIRONMENT

Publisher

ELSEVIER SCIENCE INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Utilizing unsupervised learning, multi-view imaging, and CNN-based attention facilitates cost-effective wetland mapping

Journal

REMOTE SENSING OF ENVIRONMENT

Publisher

ELSEVIER SCIENCE INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper