☆ 4.4 Article

Efficient detection of hacker community based on twitter data using complex networks and machine learning algorithm

JOURNAL OF INTELLIGENT & FUZZY SYSTEMS (2021)

期刊

JOURNAL OF INTELLIGENT & FUZZY SYSTEMS

卷 40, 期 6, 页码 12321-12337

出版社

IOS PRESS

DOI: 10.3233/JIFS-210458

关键词

Tweets; hacking; prediction; twitter; social networks

类别

Computer Science, Artificial Intelligence

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This research aims to enhance the efficiency of hacker detection on Twitter platform using complex network technique and machine learning algorithms. By collecting and classifying hackers' tweets, a dataset containing real users from hackers' community is built. Utilizing various machine learning algorithms, the research successfully detects hackers accurately and predicts the risk of tweets.

Twitter is one of the most popular platforms used to share and post ideas. Hackers and anonymous attackers use these platforms maliciously, and their behavior can be used to predict the risk of future attacks, by gathering and classifying hackers' tweets using machine-learning techniques. Previous approaches for detecting infected tweets are based on human efforts or text analysis, thus they are limited to capturing the hidden text between tweet lines. The main aim of this research paper is to enhance the efficiency of hacker detection for the Twitter platform using the complex networks technique with adapted machine learning algorithms. This work presents a methodology that collects a list of users with their followers who are sharing their posts that have similar interests from a hackers' community on Twitter. The list is built based on a set of suggested keywords that are the commonly used terms by hackers in their tweets. After that, a complex network is generated for all users to find relations among them in terms of network centrality, closeness, and betweenness. After extracting these values, a dataset of the most influential users in the hacker community is assembled. Subsequently, tweets belonging to users in the extracted dataset are gathered and classified into positive and negative classes. The output of this process is utilized with a machine learning process by applying different algorithms. This research build and investigate an accurate dataset containing real users who belong to a hackers' community. Correctly, classified instances were measured for accuracy using the average values of K-nearest neighbor, Naive Bayes, Random Tree, and the support vector machine techniques, demonstrating about 90% and 88% accuracy for cross-validation and percentage split respectively. Consequently, the proposed network cyber Twitter model is able to detect hackers, and determine if tweets pose a risk to future institutions and individuals to provide early warning of possible attacks.

Efficient detection of hacker community based on twitter data using complex networks and machine learning algorithm

期刊

JOURNAL OF INTELLIGENT & FUZZY SYSTEMS

出版社

IOS PRESS

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Efficient detection of hacker community based on twitter data using complex networks and machine learning algorithm

期刊

JOURNAL OF INTELLIGENT & FUZZY SYSTEMS

出版社

IOS PRESS

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文