RadImageNet: An Open Radiologic Deep Learning Research Dataset for Effective Transfer Learning

Xueyan Mei, Zelong Liu, Philip M. Robson, Brett Marinelli, Mingqian Huang, Amish Doshi, Adam Jacobi, Chendi Cao, Katherine E. Link, Thomas Yang, Ying Wang, Hayit Greenspan, Timothy Deyer, Zahi A. Fayad, Yang Yang*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Purpose: To demonstrate the value of pretraining with millions of radiologic images compared with ImageNet photographic images on downstream medical applications when using transfer learning. Materials and Methods: This retrospective study included patients who underwent a radiologic study between 2005 and 2020 at an out-patient imaging facility. Key images and associated labels from the studies were retrospectively extracted from the original study inter-pretation. These images were used for RadImageNet model training with random weight initiation. The RadImageNet models were compared with ImageNet models using the area under the receiver operating characteristic curve (AUC) for eight classification tasks and using Dice scores for two segmentation problems. Results: The RadImageNet database consists of 1.35 million annotated medical images in 131 872 patients who underwent CT, MRI, and US for musculoskeletal, neurologic, oncologic, gastrointestinal, endocrine, abdominal, and pulmonary pathologic conditions. For transfer learning tasks on small datasets—thyroid nodules (US), breast masses (US), anterior cruciate ligament injuries (MRI), and meniscal tears (MRI)—the RadImageNet models demonstrated a significant advantage (P < .001) to ImageNet models (9.4%, 4.0%, 4.8%, and 4.5% AUC improvements, respectively). For larger datasets—pneumonia (chest radiography), COVID-19 (CT), SARS-CoV-2 (CT), and intracranial hemorrhage (CT)—the RadImageNet models also illustrated improved AUC (P < .001) by 1.9%, 6.1%, 1.7%, and 0.9%, respectively. Additionally, lesion localizations of the RadImageNet models were improved by 64.6% and 16.4% on thyroid and breast US datasets, respectively. Conclusion: RadImageNet pretrained models demonstrated better interpretability compared with ImageNet models, especially for smaller radiologic datasets.

Original languageEnglish
Article numbere210315
JournalRadiology: Artificial Intelligence
Volume4
Issue number5
DOIs
StatePublished - Sep 2022
Externally publishedYes

Keywords

  • Brain/Brain Stem
  • CT
  • Computer Applications–General (Infor-matics)
  • Evidence-based Medicine
  • Head/Neck
  • MR Imaging
  • Thorax
  • US

Fingerprint

Dive into the research topics of 'RadImageNet: An Open Radiologic Deep Learning Research Dataset for Effective Transfer Learning'. Together they form a unique fingerprint.

Cite this