4.7 Article

Event Detection on Twitter by Mapping Unexpected Changes in Streaming Data into a Spatiotemporal Lattice

Journal

IEEE TRANSACTIONS ON BIG DATA
Volume 8, Issue 2, Pages 508-522

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TBDATA.2019.2948594

Keywords

Twitter; Event detection; Feature extraction; Spatiotemporal phenomena; Lattices; Urban areas; Data mining; Hierarchical patterns; events detection; twitter stream

Funding

  1. National Health & Medical Research Council [APP1128968]

Ask authors/readers for more resources

This research proposes a new approach to event localization and ranking by modeling the use of language in tweets. It detects unexpected changes in language usage and identifies anomalies across cities, countries, and time periods.
Many applications seek to make sense of high volume streaming data from social media by identifying spatiotemporal patterns. Events, representing topics that emerge and decay over time, are detected by monitoring for changes in the language being used, but typical approaches do not consider the localisation of events in cities and countries, and within hours, days, and weeks. This work develops and evaluates a new approach to event localisation and ranking that can be applied to Twitter data streams. The proposed approach models the use of language in tweets per city per hour to produce a model that can be used to detect the magnitude of unexpected changes in the use of the language. The approach uses a spatiotemporal lattice structure and a method for traversing between hours, days, and weeks, as well as cities, regions, and countries to identify anomalies in the language used across millions of tweets. The output is a ranked list of events comprising a list of tweets posted within a location and period of time, and characterized by language features of interest. The approach was implemented and tested by comparing events detected across five example domains (suicide, shooting, elections, sports, and sentiment) using 11.7 million tweets from users located in 100 cities and posted within the 203-day study period. Experiments demonstrate that the approach can detect events across a range of application domains.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available