☆ 4.6 Article

Multi-Scale Cost Attention and Adaptive Fusion Stereo Matching Network

ELECTRONICS (2023)

期刊

ELECTRONICS

卷 12, 期 7, 页码 -

出版社

MDPI

DOI: 10.3390/electronics12071594

关键词

cost attention; adaptive fusion; attention mechanism; stereo matching

类别

Computer Science, Information Systems Engineering, Electrical & Electronic Physics, Applied

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Compared to 3D convolution, 2D convolution is less computationally expensive and faster in stereo matching methods. However, the initial cost volume generated by 2D convolution lacks rich information, resulting in lower robustness and accuracy in the disparity map affected by illumination. To address this, the proposed MCAFNet utilizes multi-scale adaptive cost attention and adaptive fusion to enrich the cost volume. With the improvements, the model achieves better performance in terms of EPE and error matching rates on different datasets.

At present, compared to 3D convolution, 2D convolution is less computationally expensive and faster in stereo matching methods based on convolution. However, compared to the initial cost volume generated by calculation using a 3D convolution method, the initial cost volume generated by 2D convolution in the relevant layer lacks rich information, resulting in the area affected by illumination in the disparity map having a lower robustness and thus affecting its accuracy. Therefore, to address the lack of rich cost volume information in the 2D convolution method, this paper proposes a multi-scale adaptive cost attention and adaptive fusion stereo matching network (MCAFNet) based on AANet+. Firstly, the extracted features are used for initial cost calculation, and the cost volume is input into the multi-scale adaptive cost attention module to generate attention weight, which is then combined with the initial cost volume to suppress irrelevant information and enrich the cost volume. Secondly, the cost aggregation part of the model is improved. A multi-scale adaptive fusion module is added to improve the fusion efficiency of cross-scale cost aggregation. In the Scene Flow dataset, the EPE is reduced to 0.66. The error matching rates in the KITTI2012 and KITTI2015 datasets are 1.60% and 2.22%, respectively.

Multi-Scale Cost Attention and Adaptive Fusion Stereo Matching Network

期刊

ELECTRONICS

出版社

MDPI

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Multi-Scale Cost Attention and Adaptive Fusion Stereo Matching Network

期刊

ELECTRONICS

出版社

MDPI

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文