4.6 Article

Hourglass 3D CNN for Stereo Disparity Estimation for Mobile Robots

Journal

APPLIED SCIENCES-BASEL
Volume 13, Issue 19, Pages -

Publisher

MDPI
DOI: 10.3390/app131910677

Keywords

mobile robots; depth estimation; stereo camera

Ask authors/readers for more resources

This paper proposes a method to improve the accuracy of depth estimation by applying a deep encoder-decoder architecture to a 3D CNN. The proposed architecture achieves an optimal tradeoff between computational cost and accuracy.
Stereo cameras allow mobile robots to perceive depth in their surroundings by capturing two separate images from slightly different perspectives. This is necessary for tasks such as obstacle avoidance, navigation, and spatial mapping. By utilizing a convolutional neural network (CNN), existing works in stereo cameras based on depth estimation have achieved superior results. However, the critical requirement for depth estimation for mobile robots is to have an optimal tradeoff between computational cost and accuracy. To achieve such a tradeoff, attention-aware feature aggregation (AAFS) has been proposed for real-time stereo matching on edge devices. AAFS includes multistage feature extraction, an attention module, and a 3D CNN architecture. However, its 3D CNN architecture learns contextual information ineffectively. In this paper, a deep encoder-decoder architecture is applied to an AAFS 3D CNN to improve depth estimation accuracy. Through evaluation, it is proven that the proposed 3D CNN architecture provides significantly better accuracy while keeping the inference time comparable to that of AAFS.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available