Published in

Wiley, Molecular Informatics, 7(32), p. 609-623, 2013

DOI: 10.1002/minf.201300016

Links

Tools

Export citation

Search in Google Scholar

Modelling structure activity landscapes with cliffs : a kernel regression-based approach

Journal article published in 2013 by Cléo Tebby ORCID, Enrico Mombelli
This paper is made freely available by the publisher.
This paper is made freely available by the publisher.

Full text: Download

Green circle
Preprint: archiving allowed
Orange circle
Postprint: archiving restricted
Red circle
Published version: archiving forbidden
Data provided by SHERPA/RoMEO

Abstract

Quantitative Structure-Activity Relationship (QSAR) models are increasingly used in hazard and risk assessment. Even when models with linear relationships between activity and a small number of descriptors are built and validated regarding predictivity and statistical assumptions, similar structures can exhibit large differences in activity known as similarity paradoxes or activity cliffs. In order to reduce the impact that similarity paradoxes can have on predictions we have devised a statistical method based on Nadaraya-Watson kernel regression. According to our method, activity cliffs filter out contributions of neighbouring chemicals especially along the cliff axis. Our method decreases density-based certainty in particular for chemicals with strong prediction errors and the implementation of Structure-Activity Landscape Index (SALI) curves shows that our method improves the prediction of activity cliff ranks. We also provide useful indications on the density-based applicability domain and the reliability of individual predictions.