4.6 Article

Multi-Scale Cost Attention and Adaptive Fusion Stereo Matching Network

期刊

ELECTRONICS
卷 12, 期 7, 页码 -

出版社

MDPI
DOI: 10.3390/electronics12071594

关键词

cost attention; adaptive fusion; attention mechanism; stereo matching

向作者/读者索取更多资源

Compared to 3D convolution, 2D convolution is less computationally expensive and faster in stereo matching methods. However, the initial cost volume generated by 2D convolution lacks rich information, resulting in lower robustness and accuracy in the disparity map affected by illumination. To address this, the proposed MCAFNet utilizes multi-scale adaptive cost attention and adaptive fusion to enrich the cost volume. With the improvements, the model achieves better performance in terms of EPE and error matching rates on different datasets.
At present, compared to 3D convolution, 2D convolution is less computationally expensive and faster in stereo matching methods based on convolution. However, compared to the initial cost volume generated by calculation using a 3D convolution method, the initial cost volume generated by 2D convolution in the relevant layer lacks rich information, resulting in the area affected by illumination in the disparity map having a lower robustness and thus affecting its accuracy. Therefore, to address the lack of rich cost volume information in the 2D convolution method, this paper proposes a multi-scale adaptive cost attention and adaptive fusion stereo matching network (MCAFNet) based on AANet+. Firstly, the extracted features are used for initial cost calculation, and the cost volume is input into the multi-scale adaptive cost attention module to generate attention weight, which is then combined with the initial cost volume to suppress irrelevant information and enrich the cost volume. Secondly, the cost aggregation part of the model is improved. A multi-scale adaptive fusion module is added to improve the fusion efficiency of cross-scale cost aggregation. In the Scene Flow dataset, the EPE is reduced to 0.66. The error matching rates in the KITTI2012 and KITTI2015 datasets are 1.60% and 2.22%, respectively.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据