Published in

Springer, Lecture Notes in Computer Science, p. 452-463, 2014

DOI: 10.1007/978-3-319-14325-5_39

Links

Tools

Export citation

Search in Google Scholar

A semantic-based approach to attain reproducibility of computational environments in scientific workflows: a case study

This paper is available in a repository.
This paper is available in a repository.

Full text: Download

Red circle
Preprint: archiving forbidden
Orange circle
Postprint: archiving restricted
Red circle
Published version: archiving forbidden
Data provided by SHERPA/RoMEO

Abstract

Reproducible research in scientific workflows is often ad-dressed by tracking the provenance of the produced results. While this approach allows inspecting intermediate and final results, improves un-derstanding, and permits replaying a workflow execution, it does not ensure that the computational environment is available for subsequent executions to reproduce the experiment. In this work, we propose de-scribing the resources involved in the execution of an experiment using a set of semantic vocabularies, so as to conserve the computational envi-ronment. We define a process for documenting the workflow application, management system, and their dependencies based on 4 domain ontolo-gies. We then conduct an experimental evaluation using a real workflow application on an academic and a public Cloud platform. Results show that our approach can reproduce an equivalent execution environment of a predefined virtual machine image on both computing platforms.