Journal
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS
Volume 33, Issue 8, Pages 3400-3414Publisher
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TNNLS.2021.3052829
Keywords
Training; Knowledge engineering; Task analysis; Covariance matrices; Correlation; Complexity theory; Visualization; First-order networks; image classification; second-order pooling
Categories
Funding
- Hong Kong RGC GRF Project [PolyU 152135/16E]
- NSF of China [61971086]
Ask authors/readers for more resources
This study introduces a novel architecture that effectively utilizes second-order pooling while maintaining model complexity unchanged during inference. During training, auxiliary second-order pooling networks help the backbone first-order network learn more discriminative feature representations. After training, all auxiliary branches can be removed, and only the backbone first-order network is used for inference.
Second-order pooling has proved to be more effective than its first-order counterpart in visual classification tasks. However, second-order pooling suffers from the high demand for a computational resource, limiting its use in practical applications. In this work, we present a novel architecture, namely a detachable second-order pooling network, to leverage the advantage of second-order pooling by first-order networks while keeping the model complexity unchanged during inference. Specifically, we introduce second-order pooling at the end of a few auxiliary branches and plug them into different stages of a convolutional neural network. During the training stage, the auxiliary second-order pooling networks assist the backbone first-order network to learn more discriminative feature representations. When training is completed, all auxiliary branches can be removed, and only the backbone first-order network is used for inference. Experiments conducted on CIFAR-10, CIFAR-100, and ImageNet data sets clearly demonstrated the leading performance of our network, which achieves even higher accuracy than second-order networks but keeps the low inference complexity of first-order networks.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available