A patient-centric dataset of images and metadata for identifying melanomas using clinical context

Veronica Rotemberg*, Nicholas Kurtansky, Brigid Betz-Stablein, Liam Caffery, Emmanouil Chousakos, Noel Codella, Marc Combalia, Stephen Dusza, Pascale Guitera, David Gutman, Allan Halpern, Brian Helba, Harald Kittler, Kivanc Kose, Steve Langer, Konstantinos Lioprys, Josep Malvehy, Shenara Musthaq, Jabpani Nanda, Ofer ReiterGeorge Shih, Alexander Stratigos, Philipp Tschandl, Jochen Weber, H. Peter Soyer

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

240 Scopus citations


Prior skin image datasets have not addressed patient-level information obtained from multiple skin lesions from the same patient. Though artificial intelligence classification algorithms have achieved expert-level performance in controlled studies examining single images, in practice dermatologists base their judgment holistically from multiple lesions on the same patient. The 2020 SIIM-ISIC Melanoma Classification challenge dataset described herein was constructed to address this discrepancy between prior challenges and clinical practice, providing for each image in the dataset an identifier allowing lesions from the same patient to be mapped to one another. This patient-level contextual information is frequently used by clinicians to diagnose melanoma and is especially useful in ruling out false positives in patients with many atypical nevi. The dataset represents 2,056 patients (20.8% with at least one melanoma, 79.2% with zero melanomas) from three continents with an average of 16 lesions per patient, consisting of 33,126 dermoscopic images and 584 (1.8%) histopathologically confirmed melanomas compared with benign melanoma mimickers.

Original languageEnglish
Article number34
JournalScientific data
Issue number1
StatePublished - Dec 2021
Externally publishedYes


FundersFunder number
NIH/NCIP30 CA008748
National Institutes of Health
National Cancer InstituteP30CA008748
Melanoma Research AllianceYoung Investigator Award 614197
National Health and Medical Research CouncilAPP 1099021
Australian Cancer Research FoundationAustralian Centre of Excellence in Melaa Imaging, Diags (ACEMID)
University of Queensland


    Dive into the research topics of 'A patient-centric dataset of images and metadata for identifying melanomas using clinical context'. Together they form a unique fingerprint.

    Cite this