☆ 4.7 Article

When blockchain meets AI: Optimal mining strategy achieved by machine learning

INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS (2021)

期刊

INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS

卷 36, 期 5, 页码 2183-2207

出版社

WILEY

DOI: 10.1002/int.22375

关键词

blockchain; MDP; proof-of-work; reinforcement learning; selfish mining

类别

Computer Science, Artificial Intelligence

资金

National Key R&D Program of China [2018YFB2100705]
Natural Science Fund of Guangdong Province [2020A1515010708]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This study applies reinforcement learning to derive an optimal Bitcoin-like blockchain mining strategy without requiring knowledge of the network model. By designing a new multidimensional RL algorithm, it achieves performance approaching the optimal mining strategy even in time-varying blockchain networks.

This study applies reinforcement learning (RL) from the AI machine learning field to derive an optimal Bitcoin-like blockchain mining strategy. A salient feature of the RL learning framework is that an optimal (or near-optimal) strategy can be obtained without knowing the details of the blockchain network model. Previously, the most profitable mining strategy was believed to be honest mining encoded in the default blockchain protocol. It was shown later that it is possible to gain more mining rewards by deviating from honest mining. In particular, the mining problem can be formulated as a Markov Decision Process (MDP) which can be solved to give the optimal mining strategy. However, solving the mining MDP requires knowing the values of various parameters that characterize the blockchain network model. In real blockchain networks, these parameter values are not easy to obtain and may change over time. This hinders the use of the MDP model-based solution. In this study, we employ RL to dynamically learn a mining strategy with performance approaching that of the optimal mining strategy. Since the mining MDP problem has a nonlinear objective function (rather than linear functions of standard MDP problems), we design a new multidimensional RL algorithm to solve the problem. Experimental results indicate that, without knowing the parameter values of the mining MDP model, our multidimensional RL mining algorithm can still achieve optimal performance over time-varying blockchain networks.

When blockchain meets AI: Optimal mining strategy achieved by machine learning

期刊

INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS

出版社

WILEY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

When blockchain meets AI: Optimal mining strategy achieved by machine learning

期刊

INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS

出版社

WILEY

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文