Data integration in biological research: An overview

Full text: Download

Publisher: BMC (part of Springer Nature)

Preprint: archiving allowed. Upload

Postprint: archiving allowed. Upload

Published version: archiving allowed. Upload

Policy details (opens in a new window). Data provided by SHERPA/RoMEO

Contact authors Contact

Abstract
Data sharing, integration and annotation are essential to ensure the reproducibility of the analysis and interpretation of the experimental findings. Often these activities are perceived as a role that bioinformaticians and computer scientists have to take with no or little input from the experimental biologist. On the contrary, biological researchers, being the producers and often the end users of such data, have a big role in enabling biological data integration. The quality and usefulness of data integration depend on the existence and adoption of standards, shared formats, and mechanisms that are suitable for biological researchers to submit and annotate the data, so it can be easily searchable, conveniently linked and consequently used for further biological analysis and discovery. Here, we provide background on what is data integration from a computational science point of view, how it has been applied to biological research, which key aspects contributed to its success and future directions.