Published in

Oxford University Press, Bioinformatics, 19(34), p. 3396-3398, 2018

DOI: 10.1093/bioinformatics/bty382

Links

Tools

Export citation

Search in Google Scholar

MACARON: a python framework to identify and re-annotate multi-base affected codons in whole genome/exome sequence data

This paper is made freely available by the publisher.
This paper is made freely available by the publisher.

Full text: Download

Green circle
Preprint: archiving allowed
Orange circle
Postprint: archiving restricted
Red circle
Published version: archiving forbidden
Data provided by SHERPA/RoMEO

Abstract

Abstract Summary Predicted deleteriousness of coding variants is a frequently used criterion to filter out variants detected in next-generation sequencing projects and to select candidates impacting on the risk of human diseases. Most available dedicated tools implement a base-to-base annotation approach that could be biased in presence of several variants in the same genetic codon. We here proposed the MACARON program that, from a standard VCF file, identifies, re-annotates and predicts the amino acid change resulting from multiple single nucleotide variants (SNVs) within the same genetic codon. Applied to the whole exome dataset of 573 individuals, MACARON identifies 114 situations where multiple SNVs within a genetic codon induce an amino acid change that is different from those predicted by standard single SNV annotation tool. Such events are not uncommon and deserve to be studied in sequencing projects with inconclusive findings. Availability and implementation MACARON is written in python with codes available on the GENMED website (www.genmed.fr). Supplementary information Supplementary data are available at Bioinformatics online.