☆ 4.7 Article

Adaptive Viewpoint Feature Enhancement-Based Binocular Stereoscopic Image Saliency Detection

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2022)

期刊

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY

卷 32, 期 10, 页码 6543-6556

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TCSVT.2022.3171563

关键词

Stereo image processing; Saliency detection; Feature extraction; Visualization; Neural networks; Image color analysis; Deep learning; Stereoscopic image; visual saliency; binocular vision

类别

Engineering, Electrical & Electronic

资金

National Natural Science Foundation of China [61871270, 62022002]
Shenzhen Natural Science Foundation [JCYJ20200109110410133, 20200812110350001]
Shenzhen Virtual University Park, The Science Technology and Innovation Committee of Shenzhen Municipality [2021Szvup128]
Hong Kong Research Grants Council General Research Fund (GRF) [11203220]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper proposes a visual saliency detection method for stereoscopic images based on adaptive viewpoint feature enhancement. By analyzing the correlation between left and right views and using attention-based saliency feature pyramid extraction, the proposed method improves the accuracy of saliency detection. Additionally, a stereoscopic image saliency dataset is created to facilitate further research in this field.

Modeling 3D visual saliency has received great attention due to the development of emerging 3D display technologies. Traditional methods relying on low-level features may not be efficient in interpreting 3D visual content from high-level semantic perspective. Despite numerous efforts dedicated to this area, existing 3D visual saliency detection methods do not necessarily excel in exploring the stereoscopic image saliency driven by the intra-view and inter-view dependencies among left and right views. In this paper, we propose a visual saliency detection method for stereoscopic images grounded on adaptive viewpoint feature enhancement via binocular vision. More specifically, the correlation among left and right views is investigated through a delicately designed binocular stereoscopic saliency feature aggregation module, enabling the generation of more representative saliency features towards binocular vision. Subsequently, to further aggregate the saliency features in multiple scales, we design a progressive attention-based saliency feature pyramid extraction module to effectively integrate the features from top-level to down-level based on the network hierarchy mechanism. The saliency maps are ultimately produced for stereoscopic images by evaluating the obtained saliency features. In addition, we create a stereoscopic image saliency dataset (SIS-3D) that includes 1086 stereoscopic image pairs with various content and their corresponding human eye fixation annotations, aiming to further facilitate the research on visual saliency detection for stereoscopic images. Extensive experiments demonstrate that our proposed method improves CC by an average of 4.02% compared to representative counterparts on the newly built saliency dataset and another publicly available dataset.

Adaptive Viewpoint Feature Enhancement-Based Binocular Stereoscopic Image Saliency Detection

期刊

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Adaptive Viewpoint Feature Enhancement-Based Binocular Stereoscopic Image Saliency Detection

期刊

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文