4.7 Article

DAN: Deep-Attention Network for 3D Shape Recognition

期刊

IEEE TRANSACTIONS ON IMAGE PROCESSING
卷 30, 期 -, 页码 4371-4383

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TIP.2021.3071687

关键词

Shape; Three-dimensional displays; Task analysis; Feature extraction; Correlation; Computer architecture; Visualization; 3D shape; multiview; DAN; classification; retrieval

资金

  1. National Key Research and Development Program of China [2020YFB1711704]
  2. National Natural Science Foundation of China [61872267, 61702471, 61772359]
  3. Tianjin New Generation Artificial Intelligence Major Program [18ZXZNGX00150, 19ZXZNGX00110]
  4. Open Project Program of the State Key Laboratory of CAD and CG, Zhejiang University [A2005, A2012]
  5. Tianjin Science Foundation for Young Scientists of China [19JCQNJC00500]

向作者/读者索取更多资源

In this paper, a novel deep-attention network (DAN) based on multiview information for 3D shape representation is proposed. The attention mechanism is introduced to update feature vectors and fuse information effectively, addressing the challenges of exploring effective representation of 3D shapes and reducing redundant complexity. Experimental results on public 3D shape datasets demonstrate the superiority of the proposed method in comparison with state-of-the-art methods.
Due to the wide applications in a rapidly increasing number of different fields, 3D shape recognition has become a hot topic in the computer vision field. Many approaches have been proposed in recent years. However, there remain huge challenges in two aspects: exploring the effective representation of 3D shapes and reducing the redundant complexity of 3D shapes. In this paper, we propose a novel deep-attention network (DAN) for 3D shape representation based on multiview information. More specifically, we introduce the attention mechanism to construct a deep multiattention network that has advantages in two aspects: 1) information selection, in which DAN utilizes the self-attention mechanism to update the feature vector of each view, effectively reducing the redundant information, and 2) information fusion, in which DAN applies attention mechanism that can save more effective information by considering the correlations among views. Meanwhile, deep network structure can fully consider the correlations to continuously fuse effective information. To validate the effectiveness of our proposed method, we conduct experiments on the public 3D shape datasets: ModelNet40, ModelNet10, and ShapeNetCore55. Experimental results and comparison with state-of-the-art methods demonstrate the superiority of our proposed method. Code is released on https://github.com/RiDang/DANN.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据