4.5 Article

A Data Quality Multidimensional Model for Social Media Analysis

Journal

Publisher

SPRINGER VIEWEG-SPRINGER FACHMEDIEN WIESBADEN GMBH
DOI: 10.1007/s12599-023-00840-9

Keywords

Data quality; Social media data; Business intelligence; Text analytics

Ask authors/readers for more resources

Social media platforms have become a valuable source of information for companies, and building high-quality datasets for social business intelligence requires analyzing data quality and developing practical business intelligence solutions. This paper presents a method that uses a multidimensional data model to construct cubes with various quality metrics, allowing for dynamic evaluation of quality indicators. The authors have applied this method to the automotive business and natural disasters management domains, demonstrating that data filtering can be easily performed by ranking posts according to quality metrics.
Social media platforms have become a new source of useful information for companies. Ensuring the business value of social media first requires an analysis of the quality of the relevant data and then the development of practical business intelligence solutions. This paper aims at building high-quality datasets for social business intelligence (SoBI). The proposed method offers an integrated and dynamic approach to identify the relevant quality metrics for each analysis domain. This method employs a novel multidimensional data model for the construction of cubes with impact measures for various quality metrics. In this model, quality metrics and indicators are organized in two main axes. The first one concerns the kind of facts to be extracted, namely: posts, users, and topics. The second axis refers to the quality perspectives to be assessed, namely: credibility, reputation, usefulness, and completeness. Additionally, quality cubes include a user-role dimension so that quality metrics can be evaluated in terms of the user business roles. To demonstrate the usefulness of this approach, the authors have applied their method to two separate domains: automotive business and natural disasters management. Results show that the trade-off between quantity and quality for social media data is focused on a small percentage of relevant users. Thus, data filtering can be easily performed by simply ranking the posts according to the quality metrics identified with the proposed method. As far as the authors know, this is the first approach that integrates both the extraction of analytical facts and the assessment of social media data quality in the same framework.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available