Dissemin is shutting down on January 1st, 2025

Published in

2012 IEEE 8th International Conference on E-Science

DOI: 10.1109/escience.2012.6404477

Links

Tools

Export citation

Search in Google Scholar

Temporal Representation for Scientific Data Provenance

Proceedings article published in 2012 by Peng Chen, Beth Plale ORCID, Mehmet S. Aktas
This paper is available in a repository.
This paper is available in a repository.

Full text: Download

Green circle
Preprint: archiving allowed
Green circle
Postprint: archiving allowed
Red circle
Published version: archiving forbidden
Data provided by SHERPA/RoMEO

Abstract

Provenance of digital scientific data is an important piece of the metadata of a data object. It can however grow voluminous quickly because the granularity level of capture can be high. It can also be quite feature rich. We propose a representation of the provenance data based on logical time that reduces the feature space. Creating time and frequency domain representations of the provenance, we apply clustering, classifica-tion and association rule mining to the abstract representations to determine the usefulness of the temporal representation. We evaluate the temporal representation using an existing 10 GB database of provenance captured from a range of scientific workflows.