Published in

BioMed Central, BMC Bioinformatics, 1(22), 2021

DOI: 10.1186/s12859-021-04153-0

Links

Tools

Export citation

Search in Google Scholar

DR2S: an integrated algorithm providing reference-grade haplotype sequences from heterozygous samples

Journal article published in 2021 by Steffen Klasberg ORCID, Alexander H. Schmidt, Vinzenz Lange, Gerhard Schöfl
This paper is made freely available by the publisher.
This paper is made freely available by the publisher.

Full text: Download

Green circle
Preprint: archiving allowed
Green circle
Postprint: archiving allowed
Green circle
Published version: archiving allowed
Data provided by SHERPA/RoMEO

Abstract

Abstract Background High resolution HLA genotyping of donors and recipients is a crucially important prerequisite for haematopoetic stem-cell transplantation and relies heavily on the quality and completeness of immunogenetic reference sequence databases of allelic variation. Results Here, we report on , an R package that leverages the strengths of two sequencing technologies—the accuracy of next-generation sequencing with the read length of third-generation sequencing technologies like PacBio’s SMRT sequencing or ONT’s nanopore sequencing—to reconstruct fully-phased high-quality full-length haplotype sequences. Although optimised for HLA and KIR genes, is applicable to all loci with known reference sequences provided that full-length sequencing data is available for analysis. In addition, integrates supporting tools for easy visualisation and quality control of the reconstructed haplotype to ensure suitability for submission to public allele databases. Conclusions is a largely automated workflow designed to create high-quality fully-phased reference allele sequences for highly polymorphic gene regions such as HLA or KIR. It has been used by biologists to successfully characterise and submit more than 500 HLA alleles and more than 500 KIR alleles to the IPD-IMGT/HLA and IPD-KIR databases.