☆ 4.6 Article

Drifted Twitter Spam Classification Using Multiscale Detection Test on K-L Divergence

IEEE ACCESS (2019)

Journal

IEEE ACCESS

Volume 7, Issue -, Pages 108384-108394

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/ACCESS.2019.2932018

Keywords

Concept drift; drift detection test; twitter spam classification; K-L divergence

Funding

National Natural Science Foundation of China [51775385, 61703279, 71371142]
Fundamental Research Funds for the Central Universities

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Twitter spam classification is a tough challenge for social media platforms and cyber security companies. Twitter spam with illegal links may evolve over time in order to deceive filtering models, causing disastrous loss to both users and the whole network. We define this distributional evolution as a concept drift scenario. To build an effective model, we adopt K-L divergence to represent spam distribution and use a multiscale drift detection test (MDDT) to localize possible drifts therein. A base classifier is then retrained based on the detection result to gain performance improvement. Comprehensive experiments show that K-L divergence has highly consistent change patterns between features when a drift occurs. Also, the MDDT is proved to be effective in improving final classification result in both accuracy, recall, and f-measure.

Drifted Twitter Spam Classification Using Multiscale Detection Test on K-L Divergence

Journal

IEEE ACCESS

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Drifted Twitter Spam Classification Using Multiscale Detection Test on K-L Divergence

Journal

IEEE ACCESS

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper