4.7 Article

Deep learning for conifer/deciduous classification of airborne LiDAR 3D point clouds representing individual trees

Journal

ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING
Volume 158, Issue -, Pages 219-230

Publisher

ELSEVIER
DOI: 10.1016/j.isprsjprs.2019.10.011

Keywords

Remote sensing; Convolutional neural network; Representation engineering; Unbalanced training data; Mislabel correction

Funding

  1. Department of Forestry at the University of Kentucky [KY009026, 1001477]
  2. McIntire-Stennis project [KY009026, 1001477]
  3. Kentucky Science and Engineering Foundation [KSEF-3405-RDE-018]
  4. University of Kentucky Centre for Computational Sciences

Ask authors/readers for more resources

The purpose of this study was to investigate the use of deep learning for coniferous/deciduous classification of individual trees segmented from airborne LiDAR data. To enable processing by a deep convolutional neural network (CNN), we designed two discrete representations using leaf-off and leaf-on LiDAR data: a digital surface model with four channels (DSM x 4) and a set of four 2D views (4 x 2D). A training dataset of tree crowns was generated via segmentation of tree crowns, followed by co-registration with field data. Potential mislabels due to GPS error or tree leaning were corrected using a statistical ensemble filtering procedure. Because the training data was heavily unbalanced (similar to 8% conifers), we trained an ensemble of CNNs on random balanced sub-samples. Benchmarked against multiple traditional shallow learning methods using manually designed features, the CNNs improved accuracies up to 14%. The 4 x 2D representation yielded similar classification accuracies to the DSM x 4 representation (similar to 82% coniferous and similar to 90% deciduous) while converging faster. Further experimentation showed that early/late fusion of the channels in the representations did not affect the accuracies in a significant way. The data augmentation that was used for the CNN training improved the classification accuracies, but more real training instances (especially coniferous) likely results in much stronger improvements. Leaf-off LiDAR data were the primary source of useful information, which is likely due to the perennial nature of coniferous foliage. LiDAR intensity values also proved to be useful, but normalization yielded no significant improvement. As we observed, large training data may compensate for the lack of a subset of important domain data. Lastly, the classification accuracies of overstory trees (similar to 90%) were more balanced than those of understory trees (similar to 90% deciduous and similar to 65% coniferous), which is likely due to the incomplete capture of understory tree crowns via airborne LiDAR. In domains like remote sensing and biomedical imaging, where the data contain a large amount of information and are not friendly to human visual system, human-designed features may become suboptimal. As exemplified by this study, automatic, objective derivation of optimal features via deep learning can improve prediction tasks in such domains.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available