4.4 Article

VIG: Data scaling for OBDA benchmarks

Journal

SEMANTIC WEB
Volume 10, Issue 2, Pages 413-433

Publisher

IOS PRESS
DOI: 10.3233/SW-180336

Keywords

Data scaling; OBDA; benchmarking

Funding

  1. project Ontology-based analysis of temporal and streaming data (OBATS) through the 2017 research budget of the Free University of Bozen-Bolzano
  2. project Ontology-based analysis of temporal and streaming data (OBATS) through the 2018 research budget of the Free University of Bozen-Bolzano
  3. project High Quality Data Integration with Ontologies (QUADRO) funded through the 2017 research budget of the Free University of Bozen-Bolzano
  4. project High Quality Data Integration with Ontologies (QUADRO) funded through the 2018 research budget of the Free University of Bozen-Bolzano
  5. project South Tyrol Longitudinal study on Longevity and Ageing (STyLoLa) through the 2017 Interdisciplinary Call

Ask authors/readers for more resources

In this paper we describe VIG, a data scaler for Ontology-Based Data Access (OBDA) benchmarks. Data scaling is a relatively recent approach, proposed in the database community, that allows for quickly scaling an input data instance to s times its size, while preserving certain application-specific characteristics. The advantages of the scaling approach are that the same generator is general, in the sense that it can be re-used on different database schemas, and that users are not required to manually input the data characteristics. In the VIG system, we lift the scaling approach from the pure database level to the OBDA level, where the domain information of ontologies and mappings has to be taken into account as well. VIG is efficient and notably each tuple is generated in constant time. To evaluate VIG, we have carried out an extensive set of experiments with three datasets (BSBM, DBLP, and NPD), using two OBDA systems (Ontop and D2RQ), backed by two relational database engines (MySQL and PostgreSQL), and compared with real-world data, ad-hoc data generators, and random data generators. The encouraging results show that the data scaling performed by VIG is efficient and that the scaled data are suitable for benchmarking OBDA systems.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available