4.6 Article

Multi-hierarchy feature extraction and multi-step cost aggregation for stereo matching

Journal

NEUROCOMPUTING
Volume 492, Issue -, Pages 601-611

Publisher

ELSEVIER
DOI: 10.1016/j.neucom.2021.12.052

Keywords

Stereo matching; Disparity estimation; Multi-hierarchy feature; Multi-step cost aggregation

Funding

  1. R&D Program of Beijing Municipal Education Commission [KJZD20191000402]
  2. National Nature Science Foundation of China [51827813, 61472029]

Ask authors/readers for more resources

Compared with traditional hand-crafted feature based methods, learning-based stereo matching methods have made significant progress in matching accuracy. However, current CNN-based methods often require a substantial amount of time and memory consumption. To address this issue, we propose an accurate and fast stereo matching network that incorporates multi-hierarchy feature extraction and multi-step cost aggregation. Experimental results demonstrate that our network achieves highly competitive disparity estimation accuracy with fast inference speed.
Compared with the traditional hand-crafted feature based methods, learning-based stereo matching methods have made great progress in matching accuracy. However, current CNN-based stereo matching methods usually require a lot of time and memory consumption. It is very difficult to achieve the good balance between disparity estimation accuracy and inference speed that is significant to the application in real scenarios. To this end, we propose a accurate and fast stereo matching network (named MMNet), which contains two key modules of Multi-hierarchy feature extraction and Multi-step cost aggregation. In order to achieve a good trade-off between better disparity estimation and faster inference speed, a lightweight multi-hierarchy feature extractor is first proposed. This module obtains reliable feature information of different scales through three stable scale hierarchy branches, and outputs multi-step feature flows containing multi-scale fusion information at each step of the highest scale hierarchy branch. Moreover, we also propose a multi-step cost aggregation scheme, which uses shallow features to guide cost aggregation for ensuring a better aggregation effect with a small number of 3D convolutions. The experimental results on SceneFlow, KITTI 2012 and KITTI 2015 datasets show that our proposed network achieves extremely competitive disparity estimation accuracy with fast inference speed. (C) 2021 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available