4.7 Article

Semantic Segmentation of Terrestrial Laser Scanning Point Clouds Using Locally Enhanced Image-Based Geometric Representations

Journal

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TGRS.2022.3161982

Keywords

Deep learning; point cloud; semantic segmentation; terrestrial laser scanning (TLS); transfer learning

Funding

  1. XJTLU Key Program Special Fund [KSF-E-40]
  2. XJTLU Research Development Fund [RDF-18-01-40]
  3. XJTLU Research Enhancement Funding [REF-21-01-003]

Ask authors/readers for more resources

This research proposes a novel image enhancement method to reveal the local geometric characteristics of point cloud data in images. The method explores various feature channel combinations and achieves improved semantic segmentation accuracy. Experimental results on the Semantic3D benchmark demonstrate the superiority of this image-based approach.
Point cloud data acquired using terrestrial laser scanning (TLS) often need to be semantically segmented to support many applications. To this end, various point-, voxel-, and image-based methods have been developed. For large-scale point cloud data, the former two types of methods often require extensive computational effort. In contrast, image-based methods are favorable from the perspective of computational efficiency. However, existing image-based methods are highly dependent on RGB information and do not provide an effective means of representing and utilizing the local geometric characteristics of point cloud data in images. This not only limits the overall segmentation accuracy but also prohibits their application to situations where the RGB information is absent. To overcome such issues, this research proposes a novel image enhancement method to reveal the local geometric characteristics in images derived by the projection of the point cloud coordinates. Based on this method, various feature channel combinations were investigated experimentally. It was found that the new combination I Z(e)D(e) (i.e., intensity, enhanced Z-coordinate, and enhanced range images) outperformed the conventional IRGB and IRGBD channel combinations. As such, the approach can be used to replace the RGB channels for semantic segmentation. Using this new combination and the pretrained HR-EHNet considered, a mean intersection over union (mIoU) of 74.2% and an overall accuracy (OA) of 92.1% were achieved on the Semantic3D benchmark, which sets a new state of the art (SOTA) for the semantic segmentation accuracy of image-based methods.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available