MedICaT: A dataset of medical images, captions, and textual references

Sanjay Subramanian, Lucy Lu Wang, Sachin Mehta, Ben Bogin, Madeleine van Zuylen, Sravanthi Parasa, Sameer Singh, Matt Gardner, Hannaneh Hajishirzi

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    Abstract

    Understanding the relationship between figures and text is key to scientific document understanding. Medical figures in particular are quite complex, often consisting of several subfigures (75% of figures in our dataset), with detailed text describing their content. Previous work studying figures in scientific papers focused on classifying figure content rather than understanding how images relate to the text. To address challenges in figure retrieval and figure-to-text alignment, we introduce MEDICAT, a dataset of medical images in context. MEDICAT consists of 217K images from 131K open access biomedical papers, and includes captions, inline references for 74% of figures, and manually annotated subfigures and subcaptions for a subset of figures. Using MEDICAT, we introduce the task of subfigure to subcaption alignment in compound figures and demonstrate the utility of inline references in image-text matching. Our data and code can be accessed at https://github.com/allenai/medicat.

    Original languageEnglish
    Title of host publicationFindings of the Association for Computational Linguistics Findings of ACL
    Subtitle of host publicationEMNLP 2020
    PublisherAssociation for Computational Linguistics (ACL)
    Pages2112-2120
    Number of pages9
    ISBN (Electronic)9781952148903
    StatePublished - 2020
    EventFindings of the Association for Computational Linguistics, ACL 2020: EMNLP 2020 - Virtual, Online
    Duration: 16 Nov 202020 Nov 2020

    Publication series

    NameFindings of the Association for Computational Linguistics Findings of ACL: EMNLP 2020

    Conference

    ConferenceFindings of the Association for Computational Linguistics, ACL 2020: EMNLP 2020
    CityVirtual, Online
    Period16/11/2020/11/20

    Fingerprint

    Dive into the research topics of 'MedICaT: A dataset of medical images, captions, and textual references'. Together they form a unique fingerprint.

    Cite this