Temporal Representation for Scientific Data Provenance

Chen, Peng; Plale, Beth; Aktas, Mehmet S.

Published in

2012 IEEE 8th International Conference on E-Science

DOI: 10.1109/escience.2012.6404477

Tools

Export citation

Search in Google Scholar

Temporal Representation for Scientific Data Provenance

Proceedings article published in 2012 by Peng Chen, Beth Plale

, Mehmet S. Aktas

This paper is available in a repository.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

Provenance of digital scientific data is an important piece of the metadata of a data object. It can however grow voluminous quickly because the granularity level of capture can be high. It can also be quite feature rich. We propose a representation of the provenance data based on logical time that reduces the feature space. Creating time and frequency domain representations of the provenance, we apply clustering, classifica-tion and association rule mining to the abstract representations to determine the usefulness of the temporal representation. We evaluate the temporal representation using an existing 10 GB database of provenance captured from a range of scientific workflows.

Published in

Links

Tools

Temporal Representation for Scientific Data Provenance

Abstract