4.1 Article

Eth2Vec: Learning contract-wide code representations for vulnerability detection on Ethereum smart contracts

Journal

BLOCKCHAIN-RESEARCH AND APPLICATIONS
Volume 3, Issue 4, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.bcra.2022.100101

Keywords

Ethereum; Smart contracts; Blockchain; Neural networks; Static analysis; Code similarity; Vulnerability detection

Ask authors/readers for more resources

Ethereum smart contracts are computer programs executed on the Ethereum blockchain to enforce agreements among untrusting parties. In this paper, we propose Eth2Vec, a machine-learning-based static analysis tool that can detect vulnerabilities in smart contracts even in rewritten codes by comparing similarities with learned contracts. Eth2Vec outperforms a recent model based on support vector machine in terms of precision, recall, and F1-score.
Ethereum smart contracts are computer programs that are deployed and executed on the Ethereum blockchain to enforce agreements among untrusting parties. Being the most prominent platform that supports smart contracts, Ethereum has been targeted by many attacks and plagued by security incidents. Consequently, many smart contract vulnerabilities have been discovered in the past decade. To detect and prevent such vulnerabilities, different security analysis tools, including static and dynamic analysis tools, have been created, but their performance decreases drastically when codes to be analyzed are constantly being rewritten. In this paper, we propose Eth2Vec, a machine-learning-based static analysis tool that detects smart contract vulnerabilities. Eth2Vec maintains its robustness against code rewrites; i.e., it can detect vulnerabilities even in rewritten codes. Other machine-learning-based static analysis tools require features, which analysts create manually, as inputs. In contrast, Eth2Vec uses a neural network for language processing to automatically learn the features of vulnerable contracts. In doing so, Eth2Vec can detect vulnerabilities in smart contracts by comparing the similarities between the codes of a target contract and those of the learned contracts. We performed experiments with existing open databases, such as Etherscan, and Eth2Vec was able to outperform a recent model based on support vector machine in terms of well-known metrics, i.e., precision, recall, and F1-score.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.1
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available