4.4 Article

Imbalanced learning of remotely sensed data for bioenergy source identification in a forest in the Wallacea region of Indonesia

Journal

REMOTE SENSING LETTERS
Volume 14, Issue 11, Pages 1119-1130

Publisher

TAYLOR & FRANCIS LTD
DOI: 10.1080/2150704X.2023.2270107

Keywords

Aerial surveillance; Urban forestry; Remote monitoring; Class imbalanced; Object detection; Machine learning

Ask authors/readers for more resources

Remote sensing technologies are crucial for supporting ecological strategies of policy-makers. This study contributes to the field by providing an aerial dataset and developing an intelligent tree-detection system for counting trees with bioenergy potential. However, imbalanced class distribution in the dataset can impact model performance. The study proposes data-level approaches to address this issue and improves the training process and object detection metrics.
Remote sensing technologies have been increasingly crucial to support policy-makers in achieving their ecological strategies. The data provided by such technology can estimate the bioenergy source production rate and monitor deforestation. This work participates in the cause by contributing an aerial dataset and developing an intelligent tree-detection system usable for counting trees with the bioenergy potential. Low-altitude flying units have been vastly used for such a purpose due to their ability to capture high-quality data from distant locations. Despite these potentials, collected images that compose a dataset are often characterized by imbalanced distribution among classes. The class disproportion can affect the overall model performance, as it severely deprives key features of under-represented classes. This study proposes data-level approaches that adopt and extend prior sampling algorithms for object detection problems. The devised techniques try to reduce the number of redundant outputs obtained from sampling methods and reduce the iteration required to achieve the target imbalance ratio by employing a systematic flow. In such a process, the class distribution of an original dataset is used as a guideline for selecting candidates for subsequent processes. Our results show that the modified dataset can reduce the length of a training process shown by fewer iterations required to achieve the final metrics of its original dataset version and lower training losses in each iteration. Additionally, the modified dataset can improve the F-score ( F 1 ) and precision metric of object detection algorithm by up to 6%.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available