☆ 3.8 Proceedings Paper

Equivariance versus Augmentation for Spherical Images

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162 (2022)

期刊

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162

卷 -, 期 -, 页码 -

出版社

JMLR-JOURNAL MACHINE LEARNING RESEARCH

关键词

类别

Computer Science, Artificial Intelligence

资金

Wallenberg AI, Autonomous Systems and Software Program (WASP) - Knut and Alice Wallenberg Foundation
Swedish Research Council
German Ministry for Education and Research (BMBF) [01IS14013A-E, 01GQ1115, 1GQ0850, 01IS18025A, 01IS18037A]
Swedish Research Council [2018-05973]
Knut and Alice Wallenberg Foundation

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This paper analyzes the role of rotational equivariance in convolutional neural networks applied to spherical images and compares the performance of group equivariant networks and standard non-equivariant networks. The study finds that, for the task of image classification, standard CNNs can achieve the same performance as equivariant networks with increased data augmentation and network size. However, for the task of semantic segmentation, equivariant networks consistently outperform non-equivariant networks with fewer parameters.

We analyze the role of rotational equivariance in convolutional neural networks (CNNs) applied to spherical images. We compare the performance of the group equivariant networks known as S2CNNs and standard non-equivariant CNNs trained with an increasing amount of data augmentation. The chosen architectures can be considered baseline references for the respective design paradigms. Our models are trained and evaluated on single or multiple items from the MNIST or FashionMNIST dataset projected onto the sphere. For the task of image classification, which is inherently rotationally invariant, we find that by considerably increasing the amount of data augmentation and the size of the networks, it is possible for the standard CNNs to reach at least the same performance as the equivariant network. In contrast, for the inherently equivariant task of semantic segmentation, the non-equivariant networks are consistently outperformed by the equivariant networks with significantly fewer parameters. We also analyze and compare the inference latency and training times of the different networks, enabling detailed tradeoff considerations between equivariant architectures and data augmentation for practical problems. The equivariant spherical networks used in the experiments are available at https://github. com/JanEGerken/sem_seg_s2cnn.

Equivariance versus Augmentation for Spherical Images

期刊

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162

出版社

JMLR-JOURNAL MACHINE LEARNING RESEARCH

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Equivariance versus Augmentation for Spherical Images

期刊

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162

出版社

JMLR-JOURNAL MACHINE LEARNING RESEARCH

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文