Published in

Zenodo, 2018

DOI: 10.5281/zenodo.1561840

Links

Tools

Export citation

Search in Google Scholar

Digital Expression Explorer 2: a repository of 4.5 trillion uniformly processed RNA-seq reads and counting

Journal article published in 2018 by Mark Ziemann ORCID, Antony Kaspi, Assam El-Osta
This paper is made freely available by the publisher.
This paper is made freely available by the publisher.

Full text: Download

Question mark in circle
Preprint: policy unknown
Question mark in circle
Postprint: policy unknown
Question mark in circle
Published version: policy unknown

Abstract

Background: Transcriptome profiling by RNA-seq has enhanced scientific understanding of gene regulation. Despite the benefits these data have brought in terms of transcriptome coverage and accuracy, there are considerable barriers-to-entry for the novice computational biologist to analyse these large data sets. There is a definite need for a repository of uniformly processed RNA-seq data that is easy to use and represents major model organisms. Findings: To address these obstacles, we developed Digital Expression Explorer 2 (DEE2), a web-based repository of RNA-seq data in the form of gene-level and transcript-level expression counts. DEE2 contains over 400,000 RNA-seq data sets from several species including yeast, Arabidopsis, worm, fruit fly, zebrafish, rat, mouse and human. Base-space sequence data downloaded from NCBI Sequence Read Archive underwent quality analysis, filtering and trimming prior to transcriptome and genome alignment and read counting using open-source tools. Uniform reference-genome and data processing methods ensure consistency across experiments, facilitating fast and reproducible meta-analyses. Conclusions: The web interface enables users to quickly identify data sets of interest through accession number and keyword searches. These data can also be accessed programmatically using a specifically designed R script. We demonstrate how DEE2 data is compatible with statistical packages such as edgeR or DESeq. DEE2 can be found at http://dee2.io