4.6 Article

A non-invasive learning branch to capture leaf-image attention for tree species classification

Journal

MULTIMEDIA TOOLS AND APPLICATIONS
Volume 81, Issue 10, Pages 13961-13978

Publisher

SPRINGER
DOI: 10.1007/s11042-022-12036-6

Keywords

Computer vision; Image classification; Attention mechanism; Tree species

Funding

  1. National Natural Science Foundation of China [62072348]
  2. Science and Technology Major Project of Hubei Province (Next-Generation AI Technologies) [2019AEA170]

Ask authors/readers for more resources

In this paper, a non-invasive attention learning branch structure is proposed to enhance the discrimination capacity of convolutional neural networks (CNNs) for complex tree species classification. By adding an attention learning branch to the typical convolutional blocks, key features of leaf images can be extracted and integrated into the classification process, resulting in remarkable improvement in performance.
Tree species classification is a necessary and challenging task for both multimedia technique and forestry engineering. Continuous developments in multimedia and vision technology enable this task to be achieved faster and more conveniently without relying on professional equipment. However, encountering many species and high similarity tree species, the existing methods cannot be completed well. In this paper, we propose a non-invasive attention learning branch structure to strengthen the discrimination of CNNs for complex tree species classification. Firstly, we proposed a novel network, namely Attention Learning Convolutional Neural Network (ALNet), in which a non-invasive learning branch for leaf-image attention capture is added in the typical convolutional blocks to strengthen the discrimination capacity of CNN, especially in the challenging task of classification of tree species with high similarity leaves, leaves image features only slightly different. Secondly, we designed a novel Attention Learning Branch (ALB), which can capture and learn leaf image attention, and be non-invasively integrated into existing CNN pipelines to capture the Region of Interest (ROI) of input features for image classification processing. Thirdly, we adopted a reliable residual approach to complete the integration of network branches, which improved overall network performance. In the Leafsnap dataset, the ALNet test was performed with 91.76% accuracy on the classification of 185 species of trees, in which trees of different subspecies contained highly similar leaf images. Moreover, in the conventional Flavia dataset of tree leaf-image, our proposed ALNet also achieved satisfying results. The proposed method showed a remarkable improvement in the work done previously.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available