Published in

Nature Research, npj Genomic Medicine, 1(5), 2020

DOI: 10.1038/s41525-020-00142-z

Links

Tools

Export citation

Search in Google Scholar

Next-generation sequencing of newborn screening genes: the accuracy of short-read mapping

This paper is made freely available by the publisher.
This paper is made freely available by the publisher.

Full text: Download

Green circle
Preprint: archiving allowed
Red circle
Postprint: archiving forbidden
Green circle
Published version: archiving allowed
Data provided by SHERPA/RoMEO

Abstract

AbstractNewborn screening programs are an integral part of public health systems aiming to save lives and improve the quality of life for infants with treatable disorders. Technological advancements have driven the expansion of newborn screening programs in the last two decades and the development of fast, accurate next-generation sequencing technology has opened the door to a range of possibilities in the field. However, technological challenges with short-read next-generation sequencing technologies remain significant in highly homologous genomic regions such as pseudogenes or paralogous genes and need to be considered when implemented in screening programs. Here, we simulate 50 genomes from populations around the world to test the extent to which high homology regions affect short-read mapping of genes related to newborn screening disorders and the impact of differential read lengths and ethnic backgrounds. We examine a 158 gene screening panel directly relevant to newborn screening and identify gene regions where read mapping is affected by homologous genomic regions at different read lengths. We also determine that the patient’s ethnic background does not have a widespread impact on mapping accuracy or coverage. Additionally, we identify newborn screening genes where alternative forms of sequencing or variant calling pipelines should be considered and demonstrate that alterations to standard variant calling can retrieve some formerly uncalled variants.