Deep learning from multiple experts improves identification of amyloid neuropathologies

Wong, Daniel R.; Tang, Ziqi; Mew, Nicholas C.; Das, Sakshi; Athey, Justin; McAleese, Kirsty E.; Kofler, Julia K.; Flanagan, Margaret E.; Borys, Ewa; White, Charles L.; Butte, Atul J.; Dugger, Brittany N.; Keiser, Michael J.

Published in

BioMed Central, Acta Neuropathologica Communications, 1(10), 2022

DOI: 10.1186/s40478-022-01365-0

Tools

Export citation

Search in Google Scholar

Deep learning from multiple experts improves identification of amyloid neuropathologies

Journal article published in 2022 by Daniel R. Wong

, Ziqi Tang

, Nicholas C. Mew

, Sakshi Das, Justin Athey, Kirsty E. McAleese

, Julia K. Kofler, Margaret E. Flanagan, Ewa Borys, Charles L. White

, Atul J. Butte

, Brittany N. Dugger

, Michael J. Keiser

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

AbstractPathologists can label pathologies differently, making it challenging to yield consistent assessments in the absence of one ground truth. To address this problem, we present a deep learning (DL) approach that draws on a cohort of experts, weighs each contribution, and is robust to noisy labels. We collected 100,495 annotations on 20,099 candidate amyloid beta neuropathologies (cerebral amyloid angiopathy (CAA), and cored and diffuse plaques) from three institutions, independently annotated by five experts. DL methods trained on a consensus-of-two strategy yielded 12.6–26% improvements by area under the precision recall curve (AUPRC) when compared to those that learned individualized annotations. This strategy surpassed individual-expert models, even when unfairly assessed on benchmarks favoring them. Moreover, ensembling over individual models was robust to hidden random annotators. In blind prospective tests of 52,555 subsequent expert-annotated images, the models labeled pathologies like their human counterparts (consensus model AUPRC = 0.74 cored; 0.69 CAA). This study demonstrates a means to combine multiple ground truths into a common-ground DL model that yields consistent diagnoses informed by multiple and potentially variable expert opinions.

Published in

Links

Tools

Deep learning from multiple experts improves identification of amyloid neuropathologies

Abstract