4.7 Article

A Novel Framework Based on Mask R-CNN and Histogram Thresholding for Scalable Segmentation of New and Old Rural Buildings

Journal

REMOTE SENSING
Volume 13, Issue 6, Pages -

Publisher

MDPI
DOI: 10.3390/rs13061070

Keywords

deep learning; rural buildings; instance segmentation; Mask R-CNN; histogram thresholding

Funding

  1. National Natural Science Foundation of China [41971157]
  2. Key R&D Programs in Guangdong Province
  3. Major National Science and Technology Projects of China [2020B0202010002]

Ask authors/readers for more resources

The study introduces a novel framework named HTMask R-CNN that can extract new and old rural buildings even with limited labeled data. By using dynamic grayscale thresholding for classification, the framework achieves higher mean Average Precision compared to traditional methods.
Mapping new and old buildings are of great significance for understanding socio-economic development in rural areas. In recent years, deep neural networks have achieved remarkable building segmentation results in high-resolution remote sensing images. However, the scarce training data and the varying geographical environments have posed challenges for scalable building segmentation. This study proposes a novel framework based on Mask R-CNN, named Histogram Thresholding Mask Region-Based Convolutional Neural Network (HTMask R-CNN), to extract new and old rural buildings even when the label is scarce. The framework adopts the result of single-object instance segmentation from the orthodox Mask R-CNN. Further, it classifies the rural buildings into new and old ones based on a dynamic grayscale threshold inferred from the result of a two-object instance segmentation task where training data is scarce. We found that the framework can extract more buildings and achieve a much higher mean Average Precision (mAP) than the orthodox Mask R-CNN model. We tested the novel framework's performance with increasing training data and found that it converged even when the training samples were limited. This framework's main contribution is to allow scalable segmentation by using significantly fewer training samples than traditional machine learning practices. That makes mapping China's new and old rural buildings viable.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available