4.6 Article

PDANet: Self-Supervised Monocular Depth Estimation Using Perceptual and Data Augmentation Consistency

期刊

APPLIED SCIENCES-BASEL
卷 11, 期 12, 页码 -

出版社

MDPI
DOI: 10.3390/app11125383

关键词

self-supervised; depth estimation; low texture; perceptual consistency; data augmentation

资金

  1. Shandong Provincial Natural Science Foundation, China [ZR2018MA032, ZR2020MA064]

向作者/读者索取更多资源

Recent studies have shown that traditional methods cannot accurately estimate the depth value of pixels, hence a network called PDANet is proposed to integrate perceptual consistency and data augmentation consistency as reliable unsupervised signals to improve the depth estimation model.
In recent studies, self-supervised learning methods have been explored for monocular depth estimation. They minimize the reconstruction loss of images instead of depth information as a supervised signal. However, existing methods usually assume that the corresponding points in different views should have the same color, which leads to unreliable unsupervised signals and ultimately damages the reconstruction loss during the training. Meanwhile, in the low texture region, it is unable to predict the disparity value of pixels correctly because of the small number of extracted features. To solve the above issues, we propose a network-PDANet-that integrates perceptual consistency and data augmentation consistency, which are more reliable unsupervised signals, into a regular unsupervised depth estimation model. Specifically, we apply a reliable data augmentation mechanism to minimize the loss of the disparity map generated by the original image and the augmented image, respectively, which will enhance the robustness of the image in the prediction of color fluctuation. At the same time, we aggregate the features of different layers extracted by a pre-trained VGG16 network to explore the higher-level perceptual differences between the input image and the generated one. Ablation studies demonstrate the effectiveness of each components, and PDANet shows high-quality depth estimation results on the KITTI benchmark, which optimizes the state-of-the-art method from 0.114 to 0.084, measured by absolute relative error for depth estimation.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据