Automated retrieval of CT images of liver lesions on the basis of image similarity: Method and preliminary results

Sandy A. Napel, Christopher F. Beaulieu, Cesar Rodriguez, Jingyu Cui, Jiajing Xu, Ankit Gupta, Daniel Korenblum, Hayit Greenspan, Yongjun Ma, Daniel L. Rubin

Research output: Contribution to journalArticlepeer-review

93 Scopus citations


Purpose: To develop a system to facilitate the retrieval of radiologic images that contain similar-appearing lesions and to perform a preliminary evaluation of this system with a database of computed tomographic (CT) images of the liver and an external standard of image similarity. Materials and Methods: Institutional review board approval was obtained for retrospective analysis of deidentified patient images. Thereafter, 30 portal venous phase CT images of the liver exhibiting one of three types of liver lesions (13 cysts, seven hemangiomas, 10 metastases) were selected. A radiologist used a controlled lexicon and a tool developed for complete and standardized description of lesions to identify and annotate each lesion with semantic features. In addition, this software automatically computed image features on the basis of image texture and boundary sharpness. Semantic and computer-generated features were weighted and combined into a feature vector representing each image. An independent reference standard was created for pairwise image similarity. This was used in a leaveone-out cross-validation to train weights that optimized the rankings of images in the database in terms of similarity to query images. Performance was evaluated by using precisionrecall curves and normalized discounted cumulative gain (NDCG), a common measure for the usefulness of information retrieval. Results: When used individually, groups of semantic, texture, and boundary features resulted in various levels of performance in retrieving relevant lesions. However, combining all features produced the best overall results. Mean precision was greater than 90% at all values of recall, and mean, best, and worst case retrieval accuracy was greater than 95%, 100%, and greater than 78%, respectively, with NDCG. Conclusion: Preliminary assessment of this approach shows excellent retrieval results for three types of liver lesions visible on portal venous CT images, warranting continued development and validation in a larger and more comprehensive database.

Original languageEnglish
Pages (from-to)243-252
Number of pages10
Issue number1
StatePublished - Jul 2010


FundersFunder number
National Cancer InstituteR01CA072023


    Dive into the research topics of 'Automated retrieval of CT images of liver lesions on the basis of image similarity: Method and preliminary results'. Together they form a unique fingerprint.

    Cite this