Most Highly Expressed Protein-Coding Genes Have a Single Dominant Isoform

de Santa Pau, Enrique Carillo; Ezkurdia, Iakes; Jm, Rodriguez; Rodriguez, Jose Manuel; Carrillo-De Santa Pau, Enrique; Vázquez, Jesús; Valencia, Alfonso; Ml, Tress; Tress, Michael Liam

Published in

American Chemical Society, Journal of Proteome Research, 4(14), p. 1880-1887, 2015

DOI: 10.1021/pr501286b

Tools

Export citation

Search in Google Scholar

Most Highly Expressed Protein-Coding Genes Have a Single Dominant Isoform

Journal article published in 2015 by Enrique Carillo de Santa Pau, Iakes Ezkurdia, Rodriguez Jm, Jose Manuel Rodriguez, Enrique Carrillo-De Santa Pau, Jesús Vázquez

, Alfonso Valencia, Tress Ml, Michael Liam Tress

This paper is available in a repository.

Full text: Download

Preprint: archiving allowed

Must obtain written permission from Editor
Must not violate ACS ethical Guidelines

Upload

Postprint: archiving restricted

Must obtain written permission from Editor
Must not violate ACS ethical Guidelines

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

Although eukaryotic cells express a wide range of alternatively spliced transcripts, it is not clear whether genes tend to express a range of transcripts simultaneously across cells, or whether most genes produce dominant isoforms, either in a tissue-specific manner or regardless of tissue. To date large-scale investigations into the pattern of transcript expression across distinct tissues have produced contradictory results. Here we attempt to determine whether genes express a dominant splice variant, but at the protein level. We interrogate peptides from 8 large-scale human proteomics experiments and databases and find that there is a single dominant protein isoform, irrespective of tissue or cell type, for the vast majority of the protein-coding genes in these experiments, in partial agreement with the conclusions from the most recent large-scale RNAseq study. Remarkably the dominant isoforms from the experimental proteomics analyses coincided overwhelmingly with the reference isoforms selected by two completely orthogonal sources, the CCDS consensus variants, which are agreed upon by separate manual genome curation teams, and the principal isoforms from the APPRIS database, predicted automatically from the conservation of protein sequence, structure and function.

Published in

Links

Tools

Most Highly Expressed Protein-Coding Genes Have a Single Dominant Isoform

Abstract