☆ 4.7 Article

Improving land cover classification in an urbanized coastal area by random forests: The role of variable selection

REMOTE SENSING OF ENVIRONMENT (2020)

Journal

REMOTE SENSING OF ENVIRONMENT

Volume 251, Issue -, Pages -

Publisher

ELSEVIER SCIENCE INC

DOI: 10.1016/j.rse.2020.112105

Keywords

Land cover classification; Complex environments; Variable selection; Random forests

Funding

Florida State University (FSU)
Jinan University
Guangdong Province of China
FSU Department of Geography

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Land cover mapping in complex environments can be challenging due to their landscape heterogeneity. With the increasing availability of various open-access remotely sensed datasets, more images acquired by different sensors and on different dates tend to be used to improve land cover classification accuracy. Selecting an appropriate feature domain with the best landscape separability is therefore crucial in meeting the requirement of computational efficiency and model interpretability. Variable selection is widely used in pattern recognition to enhance model parsimony. This study focused on the variable selection process and proposed a series of methods to select the optimal feature domain to improve land cover classification in a complex urbanized coastal area. Two decision tree models (CART-Classification and Regression Tree and CIT-Conditional Inference Tree) and five variable importance measures (GINI, PVIM-Permutated Variable Importance Measure, MDMinimum Depth, IPM-Intervention of Prediction Measure, and CPVIM-Conditional Permutation Variable Importance Measure) based on random forests were considered. Variable importance measures were applied to a set of spectral, spatial and temporal features derived from medium-resolution satellite images. Backward elimination methods were used to select the optimal feature subset. It is found that compared to the traditional band-only model, the variable selection process can significantly improve the model parsimony and computational efficiency. The CPVIM based on CIT decision tree model was more reliable in selecting relevant features regardless their correlations, but CART tended to generate higher classification accuracy. Therefore, the combination of the CART model and the ranking from the CPVIM variable measure is recommended to achieve higher classification accuracy and better data interpretability. The novelty of our work is with the insight into the merits of integrating variable selection in the land cover classification process over complex environments.

Improving land cover classification in an urbanized coastal area by random forests: The role of variable selection

Journal

REMOTE SENSING OF ENVIRONMENT

Publisher

ELSEVIER SCIENCE INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Improving land cover classification in an urbanized coastal area by random forests: The role of variable selection

Journal

REMOTE SENSING OF ENVIRONMENT

Publisher

ELSEVIER SCIENCE INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper