Published in

Public Library of Science, PLoS Computational Biology, 11(16), p. e1008399, 2020

DOI: 10.1371/journal.pcbi.1008399

Links

Tools

Export citation

Search in Google Scholar

Transfer learning enables prediction of CYP2D6 haplotype function

This paper is made freely available by the publisher.
This paper is made freely available by the publisher.

Full text: Download

Green circle
Preprint: archiving allowed
Green circle
Postprint: archiving allowed
Green circle
Published version: archiving allowed
Data provided by SHERPA/RoMEO

Abstract

Cytochrome P450 2D6 (CYP2D6) is a highly polymorphic gene whose protein product metabolizes more than 20% of clinically used drugs. Genetic variations inCYP2D6are responsible for interindividual heterogeneity in drug response that can lead to drug toxicity and ineffective treatment, makingCYP2D6one of the most important pharmacogenes. Prediction of CYP2D6 phenotype relies on curation of literature-derived functional studies to assign a functional status toCYP2D6haplotypes. As the number of large-scale sequencing efforts grows, new haplotypes continue to be discovered, and assignment of function is challenging to maintain. To address this challenge, we have trained a convolutional neural network to predict functional status ofCYP2D6haplotypes, called Hubble.2D6. Hubble.2D6 predicts haplotype function from sequence data and was trained using two pre-training steps with a combination of real and simulated data. We find that Hubble.2D6 predictsCYP2D6haplotype functional status with 88% accuracy in a held-out test set and explains 47.5% of the variance inin vitrofunctional data among star alleles with unknown function. Hubble.2D6 may be a useful tool for assigning function to haplotypes with uncurated function, and used for screening individuals who are at risk of being poor metabolizers.