BioC implementations in Go, Perl, Python and Ruby

Liu, W.; Islamaj Do an, R.; Kwon, D.; Marques, H.; Rinaldi, F.; Wilbur, W. J.; Comeau, D. C.

Published in

Oxford University Press, Database, 0(2014), p. bau059-bau059, 2014

DOI: 10.1093/database/bau059

Tools

Export citation

Search in Google Scholar

BioC implementations in Go, Perl, Python and Ruby

Journal article published in 2014 by W. Liu, R. Islamaj Do an

, D. Kwon, H. Marques, F. Rinaldi, W. J. Wilbur, D. C. Comeau

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

As part of a communitywide effort for evaluating text mining and information extraction systems applied to the biomedical domain, BioC is focused on the goal of interoperability, currently a major barrier to wide-scale adoption of text mining tools. BioC is a simple XML format, specified by DTD, for exchanging data for biomedical natural language processing. With initial implementations in C++ and Java, BioC provides libraries of code for reading and writing BioC text documents and annotations. We extend BioC to Perl, Python, Go and Ruby. We used SWIG to extend the C++ implementation for Perl and one Python implementation. A second Python implementation and the Ruby implementation use native data structures and libraries. BioC is also implemented in the Google language Go. BioC modules are functional in all of these languages, which can facilitate text mining tasks. BioC implementations are freely available through the BioC site: http://bioc.sourceforge.net.

Published in

Links

Tools

BioC implementations in Go, Perl, Python and Ruby

Abstract