4.8 Article

In situ training of feed-forward and recurrent convolutional memristor networks

期刊

NATURE MACHINE INTELLIGENCE
卷 1, 期 9, 页码 434-442

出版社

NATURE PORTFOLIO
DOI: 10.1038/s42256-019-0089-1

关键词

-

资金

  1. US Air Force Research Laboratory [FA8750-18-2-0122]
  2. Defense Advanced Research Projects Agency [D17PC00304]
  3. Beijing Advanced Innovation Center for Future Chip
  4. National Science Foundation of China [61674089, 61674092]

向作者/读者索取更多资源

The explosive growth of machine learning is largely due to the recent advancements in hardware and architecture. The engineering of network structures, taking advantage of the spatial or temporal translational isometry of patterns, naturally leads to bio-inspired, shared-weight structures such as convolutional neural networks, which have markedly reduced the number of free parameters. State-of-the-art microarchitectures commonly rely on weight-sharing techniques, but still suffer from the von Neumann bottleneck of transistor-based platforms. Here, we experimentally demonstrate the in situ training of a five-level convolutional neural network that self-adapts to non-idealities of the one-transistor one-memristor array to classify the MNIST dataset, achieving similar accuracy to the memristor-based multilayer perceptron with a reduction in trainable parameters of similar to 75% owing to the shared weights. In addition, the memristors encoded both spatial and temporal translational invariance simultaneously in a convolutional long short-term memory network-a memristor-based neural network with intrinsic 3D input processing-which was trained in situ to classify a synthetic MNIST sequence dataset using just 850 weights. These proof-of-principle demonstrations combine the architectural advantages of weight sharing and the area/energy efficiency boost of the memristors, paving the way to future edge artificial intelligence.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据