期刊
APPLIED SCIENCES-BASEL
卷 13, 期 5, 页码 -出版社
MDPI
DOI: 10.3390/app13053025
关键词
location prediction; location extraction; Twitter; Arabic tweets; social media; computational linguistics; natural language processing; feature selection; machine learning
Twitter is a popular microblogging platform with millions of active users worldwide, generating millions of posts daily. Predicting the location of tweets is crucial for various reasons, but most users don't enable geotagging and their home locations are not standardized or accurate. This study applied machine learning techniques to predict tweet locations based on their textual content and achieved 67% accuracy in experiments with Arabic tweets from Saudi Arabia.
Twitter, one of the most popular microblogging platforms, has tens of millions of active users worldwide, generating hundreds of millions of posts every day. Twitter posts, referred to as tweets, the short and the noisy text, bring many challenges with them, such as in the case of some emergency or disaster. Predicting the location of these tweets is important for social, security, human rights, and business reasons and has raised noteworthy consideration lately. However, most Twitter users disable the geo-tagging feature, and their home locations are neither standardized nor accurate. In this study, we applied four machine learning techniques named Logistic Regression, Random Forest, Multinomial Naive Bayes, and Support Vector Machine with and without the utilization of the geo-distance matrix for location prediction of a tweet using its textual content. Our extensive experiments on our vast collection of Arabic tweets From Saudi Arabia with different feature sets yielded promising results with 67% accuracy.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据