BED: a Biological Entity Dictionary based on a graph data model

Godard, Patrice; van Eyll, Jonathan

Published in

F1000Research, F1000Research, (7), p. 195, 2018

DOI: 10.12688/f1000research.13925.3

F1000Research, F1000Research, (7), p. 195, 2018

DOI: 10.12688/f1000research.13925.2

F1000Research, F1000Research, (7), p. 195

DOI: 10.12688/f1000research.13925.1

Tools

Export citation

Search in Google Scholar

BED: a Biological Entity Dictionary based on a graph data model

Journal article published in 2018 by Patrice Godard

, Jonathan van Eyll

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving forbidden

Postprint: archiving forbidden

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

The understanding of molecular processes involved in a specific biological system can be significantly improved by combining and comparing different data sets and knowledge resources. However, these information sources often use different identification systems and an identifier conversion step is required before any integration effort. Mapping between identifiers is often provided by the reference information resources and several tools have been implemented to simplify their use. However, most of these tools do not combine the information provided by individual resources to increase the completeness of the mapping process. Also, deprecated identifiers from former versions of databases are not taken into account. Finally, finding automatically the most relevant path to map identifiers from one scope to the other is often not trivial. The Biological Entity Dictionary (BED) addresses these three challenges by relying on a graph data model describing possible relationships between entities and their identifiers. This model has been implemented using Neo4j and an R package provides functions to query the graph but also to create and feed a custom instance of the database. This design combined with a local installation of the graph database and a cache system make BED very efficient to convert large lists of identifiers.

Published in

Links

Tools

BED: a Biological Entity Dictionary based on a graph data model

Abstract