4.1 Article

Eth2Vec: Learning contract-wide code representations for vulnerability detection on Ethereum smart contracts

期刊

出版社

ELSEVIER
DOI: 10.1016/j.bcra.2022.100101

关键词

Ethereum; Smart contracts; Blockchain; Neural networks; Static analysis; Code similarity; Vulnerability detection

向作者/读者索取更多资源

Ethereum smart contracts are computer programs executed on the Ethereum blockchain to enforce agreements among untrusting parties. In this paper, we propose Eth2Vec, a machine-learning-based static analysis tool that can detect vulnerabilities in smart contracts even in rewritten codes by comparing similarities with learned contracts. Eth2Vec outperforms a recent model based on support vector machine in terms of precision, recall, and F1-score.
Ethereum smart contracts are computer programs that are deployed and executed on the Ethereum blockchain to enforce agreements among untrusting parties. Being the most prominent platform that supports smart contracts, Ethereum has been targeted by many attacks and plagued by security incidents. Consequently, many smart contract vulnerabilities have been discovered in the past decade. To detect and prevent such vulnerabilities, different security analysis tools, including static and dynamic analysis tools, have been created, but their performance decreases drastically when codes to be analyzed are constantly being rewritten. In this paper, we propose Eth2Vec, a machine-learning-based static analysis tool that detects smart contract vulnerabilities. Eth2Vec maintains its robustness against code rewrites; i.e., it can detect vulnerabilities even in rewritten codes. Other machine-learning-based static analysis tools require features, which analysts create manually, as inputs. In contrast, Eth2Vec uses a neural network for language processing to automatically learn the features of vulnerable contracts. In doing so, Eth2Vec can detect vulnerabilities in smart contracts by comparing the similarities between the codes of a target contract and those of the learned contracts. We performed experiments with existing open databases, such as Etherscan, and Eth2Vec was able to outperform a recent model based on support vector machine in terms of well-known metrics, i.e., precision, recall, and F1-score.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.1
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据