☆ 4.7 Article

Convolutional Neural Networks Are Not Invariant to Translation, but They Can Learn to Be

JOURNAL OF MACHINE LEARNING RESEARCH (2021)

期刊

JOURNAL OF MACHINE LEARNING RESEARCH

卷 22, 期 -, 页码 -

出版社

MICROTOME PUBL

关键词

Equivariance; internal representation; convolutional neural networks; translation invariance

类别

Automation & Control Systems Computer Science, Artificial Intelligence

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This study tested various CNN architectures and found that, apart from DenseNet-121, none of the tested models were architecturally invariant to translation, although they could learn this invariance. By pretraining, using simpler datasets, and avoiding catastrophic forgetting/interference, translation invariance can be achieved.

When seeing a new object, humans can immediately recognize it across different retinal locations: the internal object representation is invariant to translation. It is commonly believed that Convolutional Neural Networks (CNNs) are architecturally invariant to translation thanks to the convolution and/or pooling operations they are endowed with. In fact, several studies have found that these networks systematically fail to recognise new objects on untrained locations. In this work, we test a wide variety of CNNs architectures showing how, apart from DenseNet-121, none of the models tested was architecturally invariant to translation. Nevertheless, all of them could learn to be invariant to translation. We show how this can be achieved by pretraining on ImageNet, and it is sometimes possible with much simpler data sets when all the items are fully translated across the input canvas. At the same time, this invariance can be disrupted by further training due to catastrophic forgetting/interference. These experiments show how pretraining a network on an environment with the right 'latent' characteristics (a more naturalistic environment) can result in the network learning deep perceptual rules which would dramatically improve subsequent generalization.

Convolutional Neural Networks Are Not Invariant to Translation, but They Can Learn to Be

期刊

JOURNAL OF MACHINE LEARNING RESEARCH

出版社

MICROTOME PUBL

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Convolutional Neural Networks Are Not Invariant to Translation, but They Can Learn to Be

期刊

JOURNAL OF MACHINE LEARNING RESEARCH

出版社

MICROTOME PUBL

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文