4.6 Article

Predicting Influential Blogger's by a Novel, Hybrid and Optimized Case Based Reasoning Approach With Balanced Random Forest Using Imbalanced Data

Journal

IEEE ACCESS
Volume 9, Issue -, Pages 6836-6854

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2020.3048610

Keywords

Blogger classification; case based reasoning (CBR); machine learning (ML); imbalanced data; text mining

Funding

  1. Department of Health Administration, Governors State University, USA
  2. COMSATS University Islamabad (CUI), Islamabad, Pakistan [CUI/ORIC-PD/20]

Ask authors/readers for more resources

Influential bloggers have the ability to understand and influence mass psychology, which can be beneficial for corporations looking to disseminate products or services quickly and effectively. This paper proposes a Framework for Influential Blogger Prediction using Case-Based Reasoning, which outperforms existing techniques in predicting and classifying influential bloggers. The CBR approach is found to optimize baseline techniques for better influential blogger identification.
Bloggers possess the capability of understanding and influencing mass psychology to a wide community of fans and followers by posting their online valuable content. Their dominance over audience can be used as a helping hand in the corporate world which desires to disseminate their product or services among diversified people belonging to varying localities, and is always on the lookout for suitable and quick ways to grasp public access. Due to this reason, influential bloggers are preferred in the online market to initiate marketing campaigns which is a thought-provoking task due to loads of blogger communities. The novelty of this paper lies in the proposed Framework for Influential Blogger Prediction based on Blogger and Blog Features (IBP-BBF) using Case-Based Reasoning (CBR) which is not only capable of handling labeled data but also unstructured data (blogs) and imbalanced data in an optimized way. Detailed labelled and unstructured data are collected by online survey of 129 bloggers and text mining of their 32,200 blogs respectively. The classification results are compared and validated with state-of-the-art machine learning techniques by using standard evaluation measures respectively in the context of imbalanced data. The results show that the proposed IBP-BBF framework through CBR modeling outperforms existing techniques in classifying and adapting the influential blogger prediction. The IBP-BBF framework performed better as compared to baseline imbalanced data classification techniques. It is found that the Balanced Random Forest contributes towards the performance of CBR approach than Balanced Bagging Classifier and RUSBoost classifier. By using the CBR approach, baseline techniques can be optimized for influential blogger identification in a better way.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available