Published in

Oxford University Press, Bioinformatics, 24(32), p. 3850-3851, 2016

DOI: 10.1093/bioinformatics/btw559

Links

Tools

Export citation

Search in Google Scholar

AlmostSignificant: simplifying quality control of high-throughput sequencing data

Journal article published in 2016 by Joseph Ward ORCID, Christian Cole ORCID, Melanie Febrer, Geoffrey J. Barton ORCID
This paper is made freely available by the publisher.
This paper is made freely available by the publisher.

Full text: Download

Green circle
Preprint: archiving allowed
Orange circle
Postprint: archiving restricted
Red circle
Published version: archiving forbidden
Data provided by SHERPA/RoMEO

Abstract

Abstract Motivation: The current generation of DNA sequencing technologies produce a large amount of data quickly. All of these data need to pass some form of quality control (QC) processing and checking before they can be used for any analysis. The large number of samples that are run through Illumina sequencing machines makes the process of QC an onerous and time-consuming task that requires multiple pieces of information from several sources. Results: AlmostSignificant is an open-source platform for aggregating multiple sources of quality metrics as well as run and sample meta-data associated with DNA sequencing runs from Illumina sequencing machines. AlmostSignificant is a graphical platform to streamline the QC of DNA sequencing data, to store these data for future reference together with extra meta-data associated with the sequencing runs not typically retained. This simplifies the challenge of monitoring the volume of data produced by Illumina sequencers. AlmostSignificant has been used to track the quality of over 80 sequencing runs covering over 2500 samples produced over the last three years. Availability and Implementation: The code and documentation for AlmostSignificant is freely available at https://github.com/bartongroup/AlmostSignificant. Contacts: c.cole@dundee.ac.uk or g.j.barton@dundee.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online.