Evaluation of Machine Learning Methods Developed for Prediction of Diabetes Complications: A Systematic Review

Tan, Kuo Ren; Seng, Jun Jie Benjamin; Kwan, Yu Heng; Chen, Ying Jie; Zainudin, Sueziani Binte; Loh, Dionne Hui Fang; Liu, Nan; Low, Lian Leng

Published in

SAGE Publications, Journal of Diabetes Science and Technology, 2(17), p. 474-489, 2021

DOI: 10.1177/19322968211056917

Tools

Export citation

Search in Google Scholar

Evaluation of Machine Learning Methods Developed for Prediction of Diabetes Complications: A Systematic Review

Journal article published in 2021 by Kuo Ren Tan

, Jun Jie Benjamin Seng

, Yu Heng Kwan

, Ying Jie Chen

, Sueziani Binte Zainudin

, Dionne Hui Fang Loh

, Nan Liu

, Lian Leng Low

This paper was not found in any repository, but could be made available legally by the author.

Full text: Unavailable

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving forbidden

Policy details

Data provided by

Abstract

Background: With the rising prevalence of diabetes, machine learning (ML) models have been increasingly used for prediction of diabetes and its complications, due to their ability to handle large complex data sets. This study aims to evaluate the quality and performance of ML models developed to predict microvascular and macrovascular diabetes complications in an adult Type 2 diabetes population. Methods: A systematic review was conducted in MEDLINE®, Embase®, the Cochrane® Library, Web of Science®, and DBLP Computer Science Bibliography databases according to the PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) checklist. Studies that developed or validated ML prediction models for microvascular or macrovascular complications in people with Type 2 diabetes were included. Prediction performance was evaluated using area under the receiver operating characteristic curve (AUC). An AUC >0.75 indicates clearly useful discrimination performance, while a positive mean relative AUC difference indicates better comparative model performance. Results: Of 13 606 articles screened, 32 studies comprising 87 ML models were included. Neural networks (n = 15) were the most frequently utilized. Age, duration of diabetes, and body mass index were common predictors in ML models. Across predicted outcomes, 36% of the models demonstrated clearly useful discrimination. Most ML models reported positive mean relative AUC compared with non-ML methods, with random forest showing the best overall performance for microvascular and macrovascular outcomes. Majority (n = 31) of studies had high risk of bias. Conclusions: Random forest was found to have the overall best prediction performance. Current ML prediction models remain largely exploratory, and external validation studies are required before their clinical implementation. Protocol Registration: Open Science Framework (registration number: 10.17605/OSF.IO/UP49X).

Published in

Links

Tools

Evaluation of Machine Learning Methods Developed for Prediction of Diabetes Complications: A Systematic Review

Abstract