4.6 Article

AMSASeg: An Attention-Based Multi-Scale Atrous Convolutional Neural Network for Real-Time Object Segmentation From 3D Point Cloud

Journal

IEEE ACCESS
Volume 9, Issue -, Pages 70789-70796

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2021.3078371

Keywords

Three-dimensional displays; Feature extraction; Image segmentation; Real-time systems; Object segmentation; Computational efficiency; Task analysis; Deep learning; convolutional neural network; object segmentation; 3D point cloud; autonomous vehicles

Funding

  1. Ministry of Oceans and Fisheries, South Korea through the Project Development of Automatic Identification Monitoring System for Fishing Gears

Ask authors/readers for more resources

A novel attention-based multi-scale atrous convolutional neural network (AMSASeg) is proposed for object segmentation from 3D point cloud, aiming to effectively extract features of objects in different scales and shapes to improve segmentation accuracy. The network demonstrates the capability to enhance segmentation performance on objects of varying sizes at real-time speed.
Extracting meaningful information on objects varying scale and shape is a challenging task while obtaining distinctive features on small to large size objects to enhance overall object segmentation accuracy from 3D point cloud. To handle this challenge, we propose an attention-based multi-scale atrous convolutional neural network (AMSASeg) for object segmentation from 3D point cloud. Specifically, a backbone network consists of three modules: distinctive atrous spatial pyramid pooling (DASPP), FireModule, and FireDeconv. The DASPP utilizes average pooling operations and atrous convolutions with different sizes to aggregate distinctive information on objects at multiple scales. The FireModule and FireDeconv are responsible to efficiently extract general features. Meanwhile, a spatial attention module (SAM) and channel attention module (CAM) aggregate spatial and semantic information on smaller objects from low-level and high-level layers, respectively. Our network enables to encode multi-scale information and extract distinct feature on overall objects to enhance segmentation performance. We evaluate our method on KITTI dataset. Experimental results demonstrate that the proposed network is effective to improve segmentation performance on small to large objects at real-time speed.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available