Machine learning techniques for personalized breast cancer risk prediction: comparison with the BCRAT and BOADICEA models

Ming, Chang; Viassolo, Valeria; Probst-Hensch, Nicole; Chappuis, Pierre O.; Dinov, Ivo D.; Katapodi, Maria C.

Published in

BioMed Central, Breast Cancer Research, 1(21), 2019

DOI: 10.1186/s13058-019-1158-4

Tools

Export citation

Search in Google Scholar

Machine learning techniques for personalized breast cancer risk prediction: comparison with the BCRAT and BOADICEA models

Journal article published in 2019 by Chang Ming

, Valeria Viassolo

, Nicole Probst-Hensch, Pierre O. Chappuis, Ivo D. Dinov, Maria C. Katapodi

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

Abstract Background Comprehensive breast cancer risk prediction models enable identifying and targeting women at high-risk, while reducing interventions in those at low-risk. Breast cancer risk prediction models used in clinical practice have low discriminatory accuracy (0.53–0.64). Machine learning (ML) offers an alternative approach to standard prediction modeling that may address current limitations and improve accuracy of those tools. The purpose of this study was to compare the discriminatory accuracy of ML-based estimates against a pair of established methods—the Breast Cancer Risk Assessment Tool (BCRAT) and Breast and Ovarian Analysis of Disease Incidence and Carrier Estimation Algorithm (BOADICEA) models. Methods We quantified and compared the performance of eight different ML methods to the performance of BCRAT and BOADICEA using eight simulated datasets and two retrospective samples: a random population-based sample of U.S. breast cancer patients and their cancer-free female relatives (N = 1143), and a clinical sample of Swiss breast cancer patients and cancer-free women seeking genetic evaluation and/or testing (N = 2481). Results Predictive accuracy (AU-ROC curve) reached 88.28% using ML-Adaptive Boosting and 88.89% using ML-random forest versus 62.40% with BCRAT for the U.S. population-based sample. Predictive accuracy reached 90.17% using ML-adaptive boosting and 89.32% using ML-Markov chain Monte Carlo generalized linear mixed model versus 59.31% with BOADICEA for the Swiss clinic-based sample. Conclusions There was a striking improvement in the accuracy of classification of women with and without breast cancer achieved with ML algorithms compared to the state-of-the-art model-based approaches. High-accuracy prediction techniques are important in personalized medicine because they facilitate stratification of prevention strategies and individualized clinical management.

Published in

Links

Tools

Machine learning techniques for personalized breast cancer risk prediction: comparison with the BCRAT and BOADICEA models

Abstract