A robust model training strategy using hard negative mining in a weakly labeled dataset for lymphatic invasion in gastric cancer

Lee, Jonghyun; Ahn, Sangjeong; Kim, Hyun‐Soo; An, Jungsuk; Sim, Jongmin

Published in

Wiley Open Access, Journal of Pathology: Clinical Research, 1(10), 2023

DOI: 10.1002/cjp2.355

Tools

Export citation

Search in Google Scholar

A robust model training strategy using hard negative mining in a weakly labeled dataset for lymphatic invasion in gastric cancer

Journal article published in 2023 by Jonghyun Lee

, Sangjeong Ahn, Hyun‐Soo Kim, Jungsuk An, Jongmin Sim

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving allowed

Upload

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

AbstractGastric cancer is a significant public health concern, emphasizing the need for accurate evaluation of lymphatic invasion (LI) for determining prognosis and treatment options. However, this task is time‐consuming, labor‐intensive, and prone to intra‐ and interobserver variability. Furthermore, the scarcity of annotated data presents a challenge, particularly in the field of digital pathology. Therefore, there is a demand for an accurate and objective method to detect LI using a small dataset, benefiting pathologists. In this study, we trained convolutional neural networks to classify LI using a four‐step training process: (1) weak model training, (2) identification of false positives, (3) hard negative mining in a weakly labeled dataset, and (4) strong model training. To overcome the lack of annotated datasets, we applied a hard negative mining approach in a weakly labeled dataset, which contained only final diagnostic information, resembling the typical data found in hospital databases, and improved classification performance. Ablation studies were performed to simulate the lack of datasets and severely unbalanced datasets, further confirming the effectiveness of our proposed approach. Notably, our results demonstrated that, despite the small number of annotated datasets, efficient training was achievable, with the potential to extend to other image classification approaches used in medicine.

Published in

Links

Tools

A robust model training strategy using hard negative mining in a weakly labeled dataset for lymphatic invasion in gastric cancer

Abstract