Storing and manipulating environmental big data with JASMIN

Lawrence, B. N.; Bennett, V. L.; Churchill, J.; Juckes, M.; Kershaw, Philip; Pascoe, Stephen; Pepler, Sam; Pritchard, M.; Stephens, A.

Published in

2013 IEEE International Conference on Big Data

DOI: 10.1109/bigdata.2013.6691556

Tools

Export citation

Search in Google Scholar

Storing and manipulating environmental big data with JASMIN

Proceedings article published in 2013 by B. N. Lawrence, V. L. Bennett, J. Churchill, M. Juckes, Philip Kershaw

, Stephen Pascoe, Sam Pepler, M. Pritchard, A. Stephens

This paper is available in a repository.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

JASMIN is a super-data-cluster designed to provide a high-performance high-volume data analysis environment for the UK environmental science community. Thus far JASMIN has been used primarily by the atmospheric science and earth observation communities, both to support their direct scientific workflow, and the curation of data products in the STFC Centre for Environmental Data Archival (CEDA). Initial JASMIN configuration and first experiences are reported here. Useful improvements in scientific workflow are presented. It is clear from the explosive growth in stored data and use that there was a pent up demand for a suitable big-data analysis environment. This demand is not yet satisfied, in part because JASMIN does not yet have enough compute, the storage is fully allocated, and not all software needs are met. Plans to address these constraints are introduced.

Published in

Links

Tools

Storing and manipulating environmental big data with JASMIN

Abstract