4.3 Article

COVID-19: Detecting Government Pandemic Measures and Public Concerns from Twitter Arabic Data Using Distributed Machine Learning

Publisher

MDPI
DOI: 10.3390/ijerph18010282

Keywords

COVID-19; coronavirus; machine learning; big data; social media; apache spark; Twitter; Arabic language; distributed computing; smart cities; smart healthcare; smart governance; Triple Bottom Line (TBL)

Funding

  1. Deanship of Scientific Research (DSR) at King Abdulaziz University, Jeddah [RG-6-611-40]

Ask authors/readers for more resources

Today's societies are more interconnected than ever before, as highlighted by the vulnerabilities exposed during the COVID-19 pandemic. This study introduces a software tool utilizing machine learning to analyze Twitter data in Arabic, aiming to detect government measures and public concerns during the pandemic. The tool's effectiveness is demonstrated through the detection of significant events, government actions, and public sentiments in both temporal and spatial dimensions, showcasing the power of Twitter in real-time information detection.
Today's societies are connected to a level that has never been seen before. The COVID-19 pandemic has exposed the vulnerabilities of such an unprecedently connected world. As of 19 November 2020, over 56 million people have been infected with nearly 1.35 million deaths, and the numbers are growing. The state-of-the-art social media analytics for COVID-19-related studies to understand the various phenomena happening in our environment are limited and require many more studies. This paper proposes a software tool comprising a collection of unsupervised Latent Dirichlet Allocation (LDA) machine learning and other methods for the analysis of Twitter data in Arabic with the aim to detect government pandemic measures and public concerns during the COVID-19 pandemic. The tool is described in detail, including its architecture, five software components, and algorithms. Using the tool, we collect a dataset comprising 14 million tweets from the Kingdom of Saudi Arabia (KSA) for the period 1 February 2020 to 1 June 2020. We detect 15 government pandemic measures and public concerns and six macro-concerns (economic sustainability, social sustainability, etc.), and formulate their information-structural, temporal, and spatio-temporal relationships. For example, we are able to detect the timewise progression of events from the public discussions on COVID-19 cases in mid-March to the first curfew on 22 March, financial loan incentives on 22 March, the increased quarantine discussions during March-April, the discussions on the reduced mobility levels from 24 March onwards, the blood donation shortfall late March onwards, the government's 9 billion SAR (Saudi Riyal) salary incentives on 3 April, lifting the ban on five daily prayers in mosques on 26 May, and finally the return to normal government measures on 29 May 2020. These findings show the effectiveness of the Twitter media in detecting important events, government measures, public concerns, and other information in both time and space with no earlier knowledge about them.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.3
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available