Measuring the performance of prediction models to personalize treatment choice

Efthimiou, Orestis; Hoogland, Jeroen; Debray, Thomas P. A.; Seo, Michael; Furukawa, Toshiaki A.; Egger, Matthias; White, Ian R.

Published in

Wiley, Statistics in Medicine, 8(42), p. 1188-1206, 2023

DOI: 10.1002/sim.9665

Tools

Export citation

Search in Google Scholar

Measuring the performance of prediction models to personalize treatment choice

Journal article published in 2023 by Orestis Efthimiou

, Jeroen Hoogland

, Thomas P. A. Debray

, Michael Seo

, Toshiaki A. Furukawa, Matthias Egger, Ian R. White

This paper was not found in any repository, but could be made available legally by the author.

Full text: Unavailable

Preprint: archiving allowed

Upload

Postprint: archiving restricted

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

When data are available from individual patients receiving either a treatment or a control intervention in a randomized trial, various statistical and machine learning methods can be used to develop models for predicting future outcomes under the two conditions, and thus to predict treatment effect at the patient level. These predictions can subsequently guide personalized treatment choices. Although several methods for validating prediction models are available, little attention has been given to measuring the performance of predictions of personalized treatment effect. In this article, we propose a range of measures that can be used to this end. We start by defining two dimensions of model accuracy for treatment effects, for a single outcome: discrimination for benefit and calibration for benefit. We then amalgamate these two dimensions into an additional concept, decision accuracy, which quantifies the model's ability to identify patients for whom the benefit from treatment exceeds a given threshold. Subsequently, we propose a series of performance measures related to these dimensions and discuss estimating procedures, focusing on randomized data. Our methods are applicable for continuous or binary outcomes, for any type of prediction model, as long as it uses baseline covariates to predict outcomes under treatment and control. We illustrate all methods using two simulated datasets and a real dataset from a trial in depression. We implement all methods in the R package predieval. Results suggest that the proposed measures can be useful in evaluating and comparing the performance of competing models in predicting individualized treatment effect.

Published in

Links

Tools

Measuring the performance of prediction models to personalize treatment choice

Abstract