Deep Learning with Limited Data: Organ Segmentation Performance by U-Net

Bardis, Michelle; Houshyar, Roozbeh; Chantaduly, Chanon; Ushinsky, Alexander; Glavis-Bloom, Justin; Shaver, Madeleine; Chow, Daniel; Uchio, Edward; Chang, Peter

Published in

MDPI, Electronics, 8(9), p. 1199, 2020

DOI: 10.3390/electronics9081199

Tools

Export citation

Search in Google Scholar

Deep Learning with Limited Data: Organ Segmentation Performance by U-Net

Journal article published in 2020 by Michelle Bardis

, Roozbeh Houshyar

, Chanon Chantaduly, Alexander Ushinsky, Justin Glavis-Bloom

, Madeleine Shaver, Daniel Chow, Edward Uchio, Peter Chang

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

(1) Background: The effectiveness of deep learning artificial intelligence depends on data availability, often requiring large volumes of data to effectively train an algorithm. However, few studies have explored the minimum number of images needed for optimal algorithmic performance. (2) Methods: This institutional review board (IRB)-approved retrospective review included patients who received prostate magnetic resonance imaging (MRI) between September 2014 and August 2018 and a magnetic resonance imaging (MRI) fusion transrectal biopsy. T2-weighted images were manually segmented by a board-certified abdominal radiologist. Segmented images were trained on a deep learning network with the following case numbers: 8, 16, 24, 32, 40, 80, 120, 160, 200, 240, 280, and 320. (3) Results: Our deep learning network’s performance was assessed with a Dice score, which measures overlap between the radiologist’s segmentations and deep learning-generated segmentations and ranges from 0 (no overlap) to 1 (perfect overlap). Our algorithm’s Dice score started at 0.424 with 8 cases and improved to 0.858 with 160 cases. After 160 cases, the Dice increased to 0.867 with 320 cases. (4) Conclusions: Our deep learning network for prostate segmentation produced the highest overall Dice score with 320 training cases. Performance improved notably from training sizes of 8 to 120, then plateaued with minimal improvement at training case size above 160. Other studies utilizing comparable network architectures may have similar plateaus, suggesting suitable results may be obtainable with small datasets.

Published in

Links

Tools

Deep Learning with Limited Data: Organ Segmentation Performance by U-Net

Abstract