☆ 4.7 Article

2D-3D Geometric Fusion network using Multi-Neighbourhood Graph Convolution for RGB-D indoor scene classification

INFORMATION FUSION (2021)

期刊

INFORMATION FUSION

卷 76, 期 -, 页码 46-54

出版社

ELSEVIER

DOI: 10.1016/j.inffus.2021.05.002

关键词

Convolutional Graph Neural Network; Multi-modal fusion; Multi-Neighbourhood Graph Neural Network; Indoor scene classification; RGB-D

类别

Computer Science, Artificial Intelligence Computer Science, Theory & Methods

资金

Secretary of Universities and Research of the Generalitat de Catalunya
European Social Fund [FI2018, TEC2016-75976-R]
Ministerio de Economia, Industria y Competitividad
European Regional Development Fund (ERDF)

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper presents a 2D-3D Fusion stage that combines 3D Geometric Features with 2D Texture Features to achieve a more robust geometric embedding, outperforming the current state-of-the-art in RGB-D indoor scene classification task based on experimental results using NYU-Depth-V2 and SUN RGB-D datasets.

Multi-modal fusion has been proved to help enhance the performance of scene classification tasks. This paper presents a 2D-3D Fusion stage that combines 3D Geometric Features with 2D Texture Features obtained by 2D Convolutional Neural Networks. To get a robust 3D Geometric embedding, a network that uses two novel layers is proposed. The first layer, Multi-Neighbourhood Graph Convolution, aims to learn a more robust geometric descriptor of the scene combining two different neighbourhoods: one in the Euclidean space and the other in the Feature space. The second proposed layer, Nearest Voxel Pooling, improves the performance of the well-known Voxel Pooling. Experimental results, using NYU-Depth-V2 and SUN RGB-D datasets, show that the proposed method outperforms the current state-of-the-art in RGB-D indoor scene classification task.

2D-3D Geometric Fusion network using Multi-Neighbourhood Graph Convolution for RGB-D indoor scene classification

期刊

INFORMATION FUSION

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

2D-3D Geometric Fusion network using Multi-Neighbourhood Graph Convolution for RGB-D indoor scene classification

期刊

INFORMATION FUSION

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文