Published in

Wiley, Molecular Informatics, 10(31), p. 741-751, 2012

DOI: 10.1002/minf.201200053

Links

Tools

Export citation

Search in Google Scholar

A kernel-based method for assessing uncertainty on individual QSAR predictions

Journal article published in 2012 by Cléo Tebby ORCID, Enrico Mombelli
This paper is made freely available by the publisher.
This paper is made freely available by the publisher.

Full text: Download

Green circle
Preprint: archiving allowed
Orange circle
Postprint: archiving restricted
Red circle
Published version: archiving forbidden
Data provided by SHERPA/RoMEO

Abstract

The assessment of uncertainty attached to individual predictions is now a priority for sound decision-making in risk assessment. QSAR predictive uncertainty is affected by a variety of factors related to the quality of the training set data, the adopted statistical models, and the distance between the query chemical and the training set. We developed a method to quantify uncertainty associated with individual linear QSAR predictions that integrates both model and experimental error uncertainty and that defines an applicability domain based on the density of training set data. Our method is based on chemical spaces defined by latent variables identified by Partial Least Squares (PLS) regressions. The method provides a kernel regression estimate of the activity of interest as well as a measure of predictive uncertainty based on a mathematical estimation of the domain of applicability and on local propagation of uncertainty associated with training set data.