☆ 4.7 Article

M-GCN: Multi-Branch Graph Convolution Network for 2D Image-based on 3D Model Retrieval

IEEE TRANSACTIONS ON MULTIMEDIA (2021)

期刊

IEEE TRANSACTIONS ON MULTIMEDIA

卷 23, 期 -, 页码 1962-1976

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TMM.2020.3006371

关键词

Three-dimensional displays; Solid modeling; Two dimensional displays; Computational modeling; Visualization; Feature extraction; Predictive models; Cross-domain retrieval; 3D model retrieval; multi-head attention; multiple graphs

类别

Computer Science, Information Systems Computer Science, Software Engineering Telecommunications

资金

National Natural Science Foundation of China [61772359, 61572356, 61872267, 61902277]
2019 Tianjin New Generation Artificial Intelligence Major Program [18ZXZNGX00150, 19ZXZNGX00110]
Open Project Program of the State Key Lab of CAD & CG, Zhejiang University [A2005, A2012]
Tianjin Science Foundation for Young Scientists of China [19JCQNJC00500]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

The study introduces a novel approach for 3D model retrieval based on 2D images, utilizing a multi-branch graph convolution network and a multi-head attention mechanism to enhance the relationship between nodes and improve retrieval performance.

2D image based 3D model retrieval is a challenging research topic in the field of 3D model retrieval. The huge gap between two modalities - 2D image and 3D model, extremely constrains the retrieval performance. In order to handle this problem, we propose a novel multi-branch graph convolution network (M-GCN) to address the 2D image based 3D model retrieval problem. First, we compute the similarity between 2D image and 3D model based on visual information to construct one cross-modalities graph model, which can provide the original relationship between image and 3D model. However, this relationship is not accurate because of the difference of modalities. Thus, the multi-head attention mechanism is employed to generate a set of fully connected edge-weighted graphs, which can predict the hidden relationship between 2D image and 3D model to further strengthen the correlation for the embedding generation of nodes. Finally, we apply the max-pooling operation to fuse the multi-graphs information and generate the fusion embeddings of nodes for retrieval. To validate the performance of our method, we evaluated M-GCN on the MI3DOR dataset, Shrec 2018 track and Shrec 2014 track. The experimental results demonstrate the superiority of our proposed method over the state-of-the-art methods.

M-GCN: Multi-Branch Graph Convolution Network for 2D Image-based on 3D Model Retrieval

期刊

IEEE TRANSACTIONS ON MULTIMEDIA

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

M-GCN: Multi-Branch Graph Convolution Network for 2D Image-based on 3D Model Retrieval

期刊

IEEE TRANSACTIONS ON MULTIMEDIA

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文