Evaluating the Cassandra NoSQL Database Approach for Genomic Data Persistency

Aniceto, Rodrigo; Xavier, Rene; Guimarães, Valeria; Hondo, Fernanda; Holanda, Maristela; Walter, Maria Emilia; Lifschitz, Sérgio

Published in

Hindawi, International Journal of Genomics, (2015), p. 1-7, 2015

DOI: 10.1155/2015/502795

Tools

Export citation

Search in Google Scholar

Evaluating the Cassandra NoSQL Database Approach for Genomic Data Persistency

Journal article published in 2015 by Rodrigo Aniceto, Rene Xavier, Valeria Guimarães, Fernanda Hondo, Maristela Holanda

, Maria Emilia Walter

, Sérgio Lifschitz

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

Rapid advances in high-throughput sequencing techniques have created interesting computational challenges in bioinformatics. One of them refers to management of massive amounts of data generated by automatic sequencers. We need to deal with the persistency of genomic data, particularly storing and analyzing these large-scale processed data. To find an alternative to the frequently considered relational database model becomes a compelling task. Other data models may be more effective when dealing with a very large amount of nonconventional data, especially for writing and retrieving operations. In this paper, we discuss the Cassandra NoSQL database approach for storing genomic data. We perform an analysis of persistency and I/O operations with real data, using the Cassandra database system. We also compare the results obtained with a classical relational database system and another NoSQL database approach, MongoDB.

Published in

Links

Tools

Evaluating the Cassandra NoSQL Database Approach for Genomic Data Persistency

Abstract