Unconstrained mining of transcript data reveals increased alternative splicing complexity in the human transcriptome

Mollet, I. G.; Ben-Dov, Claudia; Felício-Silva, Daniel; Ar, Grosso; Grosso, A. R.; Eleutério, Pedro; Alves, Ruben; Staller, Ray; Silva, Tito Santos; Carmo-Fonseca, Maria

Published in

Oxford University Press, Nucleic Acids Research, 14(38), p. 4740-4754, 2010

DOI: 10.1093/nar/gkq197

Tools

Export citation

Search in Google Scholar

Unconstrained mining of transcript data reveals increased alternative splicing complexity in the human transcriptome

Journal article published in 2010 by I. G. Mollet, Claudia Ben-Dov, Daniel Felício-Silva, Grosso Ar, A. R. Grosso

, Pedro Eleutério, Ruben Alves, Ray Staller, Tito Santos Silva, Maria Carmo-Fonseca

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

Mining massive amounts of transcript data for alternative splicing information is paramount to help understand how the maturation of RNA regulates gene expression. We developed an algorithm to cluster transcript data to annotated genes to detect unannotated splice variants. A higher number of alternatively spliced genes and isoforms were found compared to other alternative splicing databases. Comparison of human and mouse data revealed a marked increase, in human, of splice variants incorporating novel exons and retained introns. Previously unannotated exons were validated by tiling array expression data and shown to correspond preferentially to novel first exons. Retained introns were validated by tiling array and deep sequencing data. The majority of retained introns were shorter than 500 nt and had weak polypyrimidine tracts. A subset of retained introns matching small RNAs and displaying a high GC content suggests a possible coordination between splicing regulation and production of noncoding RNAs. Conservation of unannotated exons and retained introns was higher in horse, dog and cow than in rodents, and 64% of exon sequences were only found in primates. This analysis highlights previously bypassed alternative splice variants, which may be crucial to deciphering more complex pathways of gene regulation in human.

Published in

Links

Tools

Unconstrained mining of transcript data reveals increased alternative splicing complexity in the human transcriptome

Abstract