4.7 Article

Learning Multi-View Representation With LSTM for 3-D Shape Recognition and Retrieval

期刊

IEEE TRANSACTIONS ON MULTIMEDIA
卷 21, 期 5, 页码 1169-1182

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TMM.2018.2875512

关键词

3-D shape; multi-view; object recognition; object retrieval; CNN; LSTM

资金

  1. National Natural Science Foundation of China [61602499, 61471371, 61401474]
  2. Hunan Provincial Natural Science Foundation [2016JJ3025]
  3. National Postdoctoral Program for Innovative Talents [BX201600172]
  4. China Postdoctoral Science Foundation
  5. Fundamental Research Funds for the Central Universities

向作者/读者索取更多资源

Shape representation for 3-D models is an important topic in computer vision, multimedia analysis, and computer graphics. Recent multiview-based methods demonstrate promising performance for 3-D shape recognition and retrieval. However, most multiview-based methods ignore the correlations of multiple views or suffer from high computional cost. In this paper, we propose a novel multiview-based network architecture for 3-D shape recognition and retrieval. Our network combines convolutional neural networks (CNNs) with long short-term memory (LSTM) to exploit the correlative information from multiple views. Well-pretrained CNNs with residual connections are first used to extract a low-level feature of each view image rendered from a 3-D shape. Then, a LSTM and a sequence voting layer are employed to aggregate these features into a shape descriptor. The highway network and a three-step training strategy are also adopted to boost the optimization of the deep network. Experimental results on two public datasets demonstrate that the proposed method achieves promising performance for 3-D shape recognition and the state-of-the-art performance for the 3-D shape retrieval.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据