TY - JOUR
T1 - Results of the 2016 International Skin Imaging Collaboration International Symposium on Biomedical Imaging challenge
T2 - Comparison of the accuracy of computer algorithms to dermatologists for the diagnosis of melanoma from dermoscopic images
AU - International Skin Imaging Collaboration
AU - Marchetti, Michael A.
AU - Codella, Noel C.F.
AU - Dusza, Stephen W.
AU - Gutman, David A.
AU - Helba, Brian
AU - Kalloo, Aadi
AU - Mishra, Nabin
AU - Carrera, Cristina
AU - Celebi, M. Emre
AU - DeFazio, Jennifer L.
AU - Jaimes, Natalia
AU - Marghoob, Ashfaq A.
AU - Quigley, Elizabeth
AU - Scope, Alon
AU - Yélamos, Oriol
AU - Halpern, Allan C.
N1 - Publisher Copyright:
© 2017 American Academy of Dermatology, Inc.
PY - 2018/2
Y1 - 2018/2
N2 - Background: Computer vision may aid in melanoma detection. Objective: We sought to compare melanoma diagnostic accuracy of computer algorithms to dermatologists using dermoscopic images. Methods: We conducted a cross-sectional study using 100 randomly selected dermoscopic images (50 melanomas, 44 nevi, and 6 lentigines) from an international computer vision melanoma challenge dataset (n = 379), along with individual algorithm results from 25 teams. We used 5 methods (nonlearned and machine learning) to combine individual automated predictions into “fusion” algorithms. In a companion study, 8 dermatologists classified the lesions in the 100 images as either benign or malignant. Results: The average sensitivity and specificity of dermatologists in classification was 82% and 59%. At 82% sensitivity, dermatologist specificity was similar to the top challenge algorithm (59% vs. 62%, P =.68) but lower than the best-performing fusion algorithm (59% vs. 76%, P =.02). Receiver operating characteristic area of the top fusion algorithm was greater than the mean receiver operating characteristic area of dermatologists (0.86 vs. 0.71, P =.001). Limitations: The dataset lacked the full spectrum of skin lesions encountered in clinical practice, particularly banal lesions. Readers and algorithms were not provided clinical data (eg, age or lesion history/symptoms). Results obtained using our study design cannot be extrapolated to clinical practice. Conclusion: Deep learning computer vision systems classified melanoma dermoscopy images with accuracy that exceeded some but not all dermatologists.
AB - Background: Computer vision may aid in melanoma detection. Objective: We sought to compare melanoma diagnostic accuracy of computer algorithms to dermatologists using dermoscopic images. Methods: We conducted a cross-sectional study using 100 randomly selected dermoscopic images (50 melanomas, 44 nevi, and 6 lentigines) from an international computer vision melanoma challenge dataset (n = 379), along with individual algorithm results from 25 teams. We used 5 methods (nonlearned and machine learning) to combine individual automated predictions into “fusion” algorithms. In a companion study, 8 dermatologists classified the lesions in the 100 images as either benign or malignant. Results: The average sensitivity and specificity of dermatologists in classification was 82% and 59%. At 82% sensitivity, dermatologist specificity was similar to the top challenge algorithm (59% vs. 62%, P =.68) but lower than the best-performing fusion algorithm (59% vs. 76%, P =.02). Receiver operating characteristic area of the top fusion algorithm was greater than the mean receiver operating characteristic area of dermatologists (0.86 vs. 0.71, P =.001). Limitations: The dataset lacked the full spectrum of skin lesions encountered in clinical practice, particularly banal lesions. Readers and algorithms were not provided clinical data (eg, age or lesion history/symptoms). Results obtained using our study design cannot be extrapolated to clinical practice. Conclusion: Deep learning computer vision systems classified melanoma dermoscopy images with accuracy that exceeded some but not all dermatologists.
KW - International Skin Imaging Collaboration
KW - International Symposium on Biomedical Imaging
KW - computer algorithm
KW - computer vision
KW - dermatologist
KW - machine learning
KW - melanoma
KW - reader study
KW - skin cancer
UR - http://www.scopus.com/inward/record.url?scp=85030173579&partnerID=8YFLogxK
U2 - 10.1016/j.jaad.2017.08.016
DO - 10.1016/j.jaad.2017.08.016
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 28969863
AN - SCOPUS:85030173579
SN - 0190-9622
VL - 78
SP - 270-277.e1
JO - Journal of the American Academy of Dermatology
JF - Journal of the American Academy of Dermatology
IS - 2
ER -