Prediction of protein–ligand binding affinity from sequencing data with interpretable machine learning

Rube, H. Tomas; Rastogi, Chaitanya; Feng, Siqian; Kribelbauer, Judith F.; Li, Allyson; Becerra, Basheer; Melo, Lucas A. N.; Do, Bach Viet; Li, Xiaoting; Adam, Hammaad H.; Shah, Neel H.; Mann, Richard S.; Bussemaker, Harmen J.

Published in

Nature Research, Nature Biotechnology, 10(40), p. 1520-1527, 2022

DOI: 10.1038/s41587-022-01307-0

Tools

Export citation

Search in Google Scholar

Prediction of protein–ligand binding affinity from sequencing data with interpretable machine learning

Journal article published in 2022 by H. Tomas Rube, Chaitanya Rastogi

, Siqian Feng

, Judith F. Kribelbauer

, Allyson Li

, Basheer Becerra

, Lucas A. N. Melo

, Bach Viet Do, Xiaoting Li, Hammaad H. Adam, Neel H. Shah

, Richard S. Mann

, Harmen J. Bussemaker

This paper was not found in any repository, but could be made available legally by the author.

Full text: Unavailable

Preprint: archiving allowed

Upload

Postprint: archiving restricted

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

AbstractProtein–ligand interactions are increasingly profiled at high throughput using affinity selection and massively parallel sequencing. However, these assays do not provide the biophysical parameters that most rigorously quantify molecular interactions. Here we describe a flexible machine learning method, called ProBound, that accurately defines sequence recognition in terms of equilibrium binding constants or kinetic rates. This is achieved using a multi-layered maximum-likelihood framework that models both the molecular interactions and the data generation process. We show that ProBound quantifies transcription factor (TF) behavior with models that predict binding affinity over a range exceeding that of previous resources; captures the impact of DNA modifications and conformational flexibility of multi-TF complexes; and infers specificity directly from in vivo data such as ChIP-seq without peak calling. When coupled with an assay called K_D-seq, it determines the absolute affinity of protein–ligand interactions. We also apply ProBound to profile the kinetics of kinase–substrate interactions. ProBound opens new avenues for decoding biological networks and rationally engineering protein–ligand interactions.

Published in

Links

Tools

Prediction of protein–ligand binding affinity from sequencing data with interpretable machine learning

Abstract