☆ 4.8 Article

Solving the Rubik's cube with deep reinforcement learning and search

NATURE MACHINE INTELLIGENCE (2019)

期刊

NATURE MACHINE INTELLIGENCE

卷 1, 期 8, 页码 356-363

出版社

SPRINGERNATURE

DOI: 10.1038/s42256-019-0070-z

关键词

类别

Computer Science, Artificial Intelligence Computer Science, Interdisciplinary Applications

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

The Rubik's cube is a prototypical combinatorial puzzle that has a large state space with a single goal state. The goal state is unlikely to be accessed using sequences of randomly generated moves, posing unique challenges for machine learning. We solve the Rubik's cube with DeepCubeA, a deep reinforcement learning approach that learns how to solve increasingly difficult states in reverse from the goal state without any specific domain knowledge. DeepCubeA solves 100% of all test configurations, finding a shortest path to the goal state 60.3% of the time. DeepCubeA generalizes to other combinatorial puzzles and is able to solve the 15 puzzle, 24 puzzle, 35 puzzle, 48 puzzle, Lights Out and Sokoban, finding a shortest path in the majority of verifiable cases. For some combinatorial puzzles, solutions can be verified to be optimal, for others, the state space is too large to be certain that a solution is optimal. A new deep learning based search heuristic performs well on the iconic Rubik's cube and can also generalize to puzzles in which optimal solvers are intractable.

Solving the Rubik's cube with deep reinforcement learning and search

期刊

NATURE MACHINE INTELLIGENCE

出版社

SPRINGERNATURE

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Solving the Rubik's cube with deep reinforcement learning and search

期刊

NATURE MACHINE INTELLIGENCE

出版社

SPRINGERNATURE

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文