☆ 4.7 Article

Learning Multi-View Representation With LSTM for 3-D Shape Recognition and Retrieval

IEEE TRANSACTIONS ON MULTIMEDIA (2019)

期刊

IEEE TRANSACTIONS ON MULTIMEDIA

卷 21, 期 5, 页码 1169-1182

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TMM.2018.2875512

关键词

3-D shape; multi-view; object recognition; object retrieval; CNN; LSTM

类别

Computer Science, Information Systems Computer Science, Software Engineering Telecommunications

资金

National Natural Science Foundation of China [61602499, 61471371, 61401474]
Hunan Provincial Natural Science Foundation [2016JJ3025]
National Postdoctoral Program for Innovative Talents [BX201600172]
China Postdoctoral Science Foundation
Fundamental Research Funds for the Central Universities

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Shape representation for 3-D models is an important topic in computer vision, multimedia analysis, and computer graphics. Recent multiview-based methods demonstrate promising performance for 3-D shape recognition and retrieval. However, most multiview-based methods ignore the correlations of multiple views or suffer from high computional cost. In this paper, we propose a novel multiview-based network architecture for 3-D shape recognition and retrieval. Our network combines convolutional neural networks (CNNs) with long short-term memory (LSTM) to exploit the correlative information from multiple views. Well-pretrained CNNs with residual connections are first used to extract a low-level feature of each view image rendered from a 3-D shape. Then, a LSTM and a sequence voting layer are employed to aggregate these features into a shape descriptor. The highway network and a three-step training strategy are also adopted to boost the optimization of the deep network. Experimental results on two public datasets demonstrate that the proposed method achieves promising performance for 3-D shape recognition and the state-of-the-art performance for the 3-D shape retrieval.

Learning Multi-View Representation With LSTM for 3-D Shape Recognition and Retrieval

期刊

IEEE TRANSACTIONS ON MULTIMEDIA

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Learning Multi-View Representation With LSTM for 3-D Shape Recognition and Retrieval

期刊

IEEE TRANSACTIONS ON MULTIMEDIA

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文