4.7 Article

RL-DARTS: Differentiable neural architecture search via reinforcement-learning-based meta-optimizer

Journal

KNOWLEDGE-BASED SYSTEMS
Volume 234, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.knosys.2021.107585

Keywords

Neural architecture search; Reinforcement learning; Meta optimizer; Deep learning; Bi-level optimization

Ask authors/readers for more resources

This study introduces a novel differentiable search method based on reinforcement learning to optimize the architecture-parameter optimization problem in neural architecture search, aiming to improve computation efficiency, network precision, and robustness. By utilizing a double-loop algorithm to address the optimization problem in the searched super-network, the method alternates between optimizing the super-network and the meta-optimizer, leading to faster and more robust convergence.
Differentiable search approaches have attracted extensive attention recently due to their advantages in effectively finding novel neural architectures. However, these methods suffer from shortcomings on heavy computation consumption and low robustness in some cases. In this work, we propose a novel differentiable search method based on reinforcement learning, to further improve the computation efficiency, network precision, and robustness in the neural architecture search area. Our method constructs a reinforcement learning-based meta-optimizer to solve the architecture-parameter optimization problem, which is superior in properties of adaptability and robustness to fixed optimizers. This learnable meta-optimizer can alter its model parameters along with the search process to adapt the optimization procedure, making it possible to find out better structures and parameters with less time. Specifically, we formulate a double-loop algorithm to address the optimization problem in the searched super-network. Through switching between the external and internal loops, our method alternately optimizes the super-network and the meta-optimizer, which converges to the optimal location more rapidly and robustly. (C) 2021 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available