Skin Tone Analysis for Representation in Educational Materials (STAR-ED) using machine learning

Tadesse, Girmaw Abebe; Cintas, Celia; Varshney, Kush R.; Staar, Peter; Agunwa, Chinyere; Speakman, Skyler; Jia, Justin; Bailey, Elizabeth E.; Adelekun, Ademide; Lipoff, Jules B.; Onyekaba, Ginikanwa; Lester, Jenna C.; Rotemberg, Veronica; Zou, James; Daneshjou, Roxana

Published in

Nature Research, npj Digital Medicine, 1(6), 2023

DOI: 10.1038/s41746-023-00881-0

Tools

Export citation

Search in Google Scholar

Skin Tone Analysis for Representation in Educational Materials (STAR-ED) using machine learning

Journal article published in 2023 by Girmaw Abebe Tadesse, Celia Cintas, Kush R. Varshney, Peter Staar, Chinyere Agunwa, Skyler Speakman, Justin Jia, Elizabeth E. Bailey, Ademide Adelekun, Jules B. Lipoff, Ginikanwa Onyekaba, Jenna C. Lester, Veronica Rotemberg

, James Zou

, Roxana Daneshjou

This paper is made freely available by the publisher.

Full text: Download

Preprint: archiving allowed

Upload

Postprint: archiving forbidden

Published version: archiving allowed

Upload

Policy details

Data provided by

Abstract

AbstractImages depicting dark skin tones are significantly underrepresented in the educational materials used to teach primary care physicians and dermatologists to recognize skin diseases. This could contribute to disparities in skin disease diagnosis across different racial groups. Previously, domain experts have manually assessed textbooks to estimate the diversity in skin images. Manual assessment does not scale to many educational materials and introduces human errors. To automate this process, we present the Skin Tone Analysis for Representation in EDucational materials (STAR-ED) framework, which assesses skin tone representation in medical education materials using machine learning. Given a document (e.g., a textbook in .pdf), STAR-ED applies content parsing to extract text, images, and table entities in a structured format. Next, it identifies images containing skin, segments the skin-containing portions of those images, and estimates the skin tone using machine learning. STAR-ED was developed using the Fitzpatrick17k dataset. We then externally tested STAR-ED on four commonly used medical textbooks. Results show strong performance in detecting skin images (0.96 ± 0.02 AUROC and 0.90 ± 0.06 F₁ score) and classifying skin tones (0.87 ± 0.01 AUROC and 0.91 ± 0.00 F₁ score). STAR-ED quantifies the imbalanced representation of skin tones in four medical textbooks: brown and black skin tones (Fitzpatrick V-VI) images constitute only 10.5% of all skin images. We envision this technology as a tool for medical educators, publishers, and practitioners to assess skin tone diversity in their educational materials.

Published in

Links

Tools

Skin Tone Analysis for Representation in Educational Materials (STAR-ED) using machine learning

Abstract