4.4 Article Proceedings Paper

Tracking provenance in a virtual data grid

Journal

CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE
Volume 20, Issue 5, Pages 565-575

Publisher

JOHN WILEY & SONS LTD
DOI: 10.1002/cpe.1256

Keywords

grid computing; workflow; data provenance

Ask authors/readers for more resources

The virtual data model allows data sets to be described prior to, and separately from, their physical materialization. We have implemented this model in a Virtual Data Language (VDL) and associated supporting tools, which provide for both the storage, query, and retrieval of virtual data set descriptions, and the automated, on-demand materialization of virtual data sets. We use a standardized data provenance challenge exercise to illustrate the powerful queries that can be performed on the data maintained by these tools, which for a single virtual data set can include three elements: the computational procedure(s) that must be executed to materialize the data set, the runtime log(s) produced by the execution of the computation(s), and optional metadata annotation(s) that associate application semantics with data and procedures. Copyright (C) 2007 John Wiley & Sons, Ltd.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.4
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available