Published in

BioMed Central, Genome Biology, 1(10), p. 201

DOI: 10.1186/gb-2009-10-1-201

Links

Tools

Export citation

Search in Google Scholar

Identifying protein-coding genes in genomic sequences

This paper is made freely available by the publisher.
This paper is made freely available by the publisher.

Full text: Download

Green circle
Preprint: archiving allowed
Green circle
Postprint: archiving allowed
Green circle
Published version: archiving allowed
Data provided by SHERPA/RoMEO

Abstract

Abstract The vast majority of the biology of a newly sequenced genome is inferred from the set of encoded proteins. Predicting this set is therefore invariably the first step after the completion of the genome DNA sequence. Here we review the main computational pipelines used to generate the human reference protein-coding gene sets.