4.8 Article

Partial Connection Based on Channel Attention for Differentiable Neural Architecture Search

期刊

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS
卷 19, 期 5, 页码 6804-6813

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TII.2022.3184700

关键词

Channel attention; image classification; neural architecture search; partial connection

向作者/读者索取更多资源

This article proposes ADARTS, a differentiable neural architecture search method based on channel attention. By selecting channels with higher weights, important feature information is transmitted into the search space, leading to improved search efficiency and memory utilization, and avoiding the instability caused by random selection.
Differentiable neural architecture search (DARTS), as a gradient-guided search method, greatly reduces the cost of computation and speeds up the search. In DARTS, the architecture parameters are introduced to the candidate operations, but the parameters of some weight-equipped operations may not be trained well in the initial stage, which causes unfair competition between candidate operations. The weight-free operations appear in large numbers, which results in the phenomenon of performance crash. Besides, a lot of memory will be occupied during training supernet, which causes the memory utilization to be low. In this article, a partial channel connection based on channel attention for differentiable neural architecture search (ADARTS) is proposed. Some channels with higher weights are selected through the attention mechanism and sent into the operation space while the other channels are directly contacted with the processed channels. Selecting a few channels with higher attention weights can better transmit important feature information into the search space and greatly improve search efficiency and memory utilization. The instability of network structure caused by random selection can also be avoided. The experimental results show that ADARTS achieved 2.46% and 17.06% classification error rates on CIFAR-10 and CIFAR-100, respectively. ADARTS can effectively solve the problem that too many skip connections appear in the search process and obtain network structures with better performance.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据