Bioclojure: a functional library for the manipulation of biological sequences

Plieskatt, Jordan; Rinaldi, Gabriel; Brindley, Paul J.; Jia, Xinying; Potriquet, Jeremy; Bethony, Jeffrey; Mulvenna, Jason

Published in

Oxford University Press, Bioinformatics, 17(30), p. 2537-2539, 2014

DOI: 10.1093/bioinformatics/btu311

Tools

Export citation

Search in Google Scholar

Bioclojure: a functional library for the manipulation of biological sequences

Journal article published in 2014 by Jordan Plieskatt, Gabriel Rinaldi, Paul J. Brindley, Xinying Jia

, Jeremy Potriquet, Jeffrey Bethony, Jason Mulvenna

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving restricted

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

Motivation: BioClojure is an open-source library for the manipulation of biological sequence data written in the language Clojure. BioClojure aims to provide a functional framework for the processing of biological sequence data that provides simple mechanisms for concurrency and lazy evaluation of large datasets. Results: BioClojure provides parsers and accessors for a range of biological sequence formats, including UniProtXML, Genbank XML, FASTA and FASTQ. In addition, it provides wrappers for key analysis programs, including BLAST, SignalP, TMHMM and InterProScan, and parsers for analyzing their output. All interfaces leverage Clojure’s functional style and emphasize laziness and composability, so that BioClojure, and user-defined, functions can be chained into simple pipelines that are thread-safe and seamlessly integrate lazy evaluation. Availability and implementation: BioClojure is distributed under the Lesser GPL, and the source code is freely available from GitHub (https://github.com/s312569/clj-biosequence). Contact: jason.mulvenna@qimrberghofer.edu.au or jason.mulvenna@qimr.edu.au

Published in

Links

Tools

Bioclojure: a functional library for the manipulation of biological sequences

Abstract