4.7 Article

A Data Quality in Use model for Big Data

Publisher

ELSEVIER
DOI: 10.1016/j.future.2015.11.024

Keywords

Data Quality; Big Data; Measurement; Quality-in-Use; Model

Funding

  1. GEODAS-BC project (Ministerio de Economa y Competitividad)
  2. GEODAS-BC project (Fondo Europeo de Desarrollo Regional FEDER) [TIN2012-37493-C03-01]
  3. SERENIDAD project (Consejera de Educacin, Ciencia y Cultura de la Junta de Comunidades de Castilla La Mancha)
  4. SERENIDAD project (y Fondo Europeo de Desarrollo Regional FEDER) [PEII11-0327-7035]

Ask authors/readers for more resources

Beyond the hype of Big Data, something within business intelligence projects is indeed changing. This is mainly because Big Data is not only about data, but also about a complete conceptual and technological stack including raw and processed data, storage, ways of managing data, processing and analytics. A challenge that becomes even trickier is the management of the quality of the data in Big Data environments. More than ever before the need for assessing the Quality-in-Use gains importance since the real contribution business value of data can be only estimated in its context of use. Although there exists different Data Quality models for assessing the quality of regular data, none of them has been adapted to Big Data. To fill this gap, we propose the 3As Data Quality-in-Use model, which is composed of three Data Quality characteristics for assessing the levels of Data Quality-in-Use in Big Data projects: Contextual Adequacy, Operational Adequacy and Temporal Adequacy. The model can be integrated into any sort of Big Data project, as it is independent of any pre-conditions or technologies. The paper shows the way to use the model with a working example. The model accomplishes every challenge related to Data Quality program aimed for Big Data. The main conclusion is that the model can be used as an appropriate way to obtain the Quality-in-Use levels of the input data of the Big Data analysis, and those levels can be understood as indicators of trustworthiness and soundness of the results of the Big Data analysis. (C) 2015 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available