Multimodal medical image retrieval: Image categorization to improve search precision

Jayashree Kalpathy-Cramer; William Hersh

doi:10.1145/1743384.1743415

Multimodal medical image retrieval: Image categorization to improve search precision

Jayashree Kalpathy-Cramer, William Hersh

Medical Informatics and Clinical Epidemiology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

19 Scopus citations

Abstract

Effective medical image retrieval can be useful in the clinical care of patients, education and research. Traditionally, image retrieval systems have been text-based, relying on the annotations or captions associated with the images. Although text-based information retrieval methods are mature and well researched, they are limited by the quality and availability of the annotations associated with the images. Advances in computer vision have led to methods for using the image itself as the search entity. However, the success of purely content-based techniques, when applied to a diverse set of clinical images, has been somewhat limited and these systems have not had much success in the medical domain. On the other hand, as demonstrated in recent years, a combination of text-based and content-based image retrieval techniques can achieve improved retrieval performance if combined effectively. There are many approaches to multimodal retrieval including early and late fusion of weighed results from the different search engines. In this work, we use automatic annotation based on visual attributes to label images as part of the indexing process and the subsequently use these labels to filter or reorder the results during the retrieval process. Labels for medical images can be categorized along three dimensions - imaging modality, anatomical location and image finding or pathology. Our previous research has indicated that the imaging modality is most easily identified using visual techniques whereas the caption or textual annotation frequently contains the finding or pathological information about the image. Thus, it is best to use visual methods to filter the modality and occasionally, anatomy while it is better to use the textual annotation to find the finding of interest. We have created a modality classifier for the weakly labeled images in our collection using a novel approach that combines affinity propagation for the selection of class exemplars, textons and patch-based descriptors as visual features and a NaiveBayes Nearest Neighbor technique for the classification of modality using visual features. We demonstrate significant improvement in precision attained using this technique for the ImageCLEF medical retrieval task 2009 using both our textual runs as well as runs from all participants in 2009.

Original language	English (US)
Title of host publication	MIR 2010 - Proceedings of the 2010 ACM SIGMM International Conference on Multimedia Information Retrieval
Pages	165-173
Number of pages	9
DOIs	https://doi.org/10.1145/1743384.1743415
State	Published - 2010
Event	2010 ACM SIGMM International Conference on Multimedia Information Retrieval, MIR 2010 - Philadelphia, PA, United States Duration: Mar 29 2010 → Mar 31 2010

Publication series

Name	MIR 2010 - Proceedings of the 2010 ACM SIGMM International Conference on Multimedia Information Retrieval

Other

Other	2010 ACM SIGMM International Conference on Multimedia Information Retrieval, MIR 2010
Country/Territory	United States
City	Philadelphia, PA
Period	3/29/10 → 3/31/10

Keywords

Automatic annotation
Classification
Image retrieval
Machine learning

ASJC Scopus subject areas

Computer Graphics and Computer-Aided Design
Information Systems

Access to Document

10.1145/1743384.1743415

Cite this

Kalpathy-Cramer, J., & Hersh, W. (2010). Multimodal medical image retrieval: Image categorization to improve search precision. In MIR 2010 - Proceedings of the 2010 ACM SIGMM International Conference on Multimedia Information Retrieval (pp. 165-173). (MIR 2010 - Proceedings of the 2010 ACM SIGMM International Conference on Multimedia Information Retrieval). https://doi.org/10.1145/1743384.1743415

Multimodal medical image retrieval: Image categorization to improve search precision. / Kalpathy-Cramer, Jayashree; Hersh, William.
MIR 2010 - Proceedings of the 2010 ACM SIGMM International Conference on Multimedia Information Retrieval. 2010. p. 165-173 (MIR 2010 - Proceedings of the 2010 ACM SIGMM International Conference on Multimedia Information Retrieval).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Kalpathy-Cramer, J & Hersh, W 2010, Multimodal medical image retrieval: Image categorization to improve search precision. in MIR 2010 - Proceedings of the 2010 ACM SIGMM International Conference on Multimedia Information Retrieval. MIR 2010 - Proceedings of the 2010 ACM SIGMM International Conference on Multimedia Information Retrieval, pp. 165-173, 2010 ACM SIGMM International Conference on Multimedia Information Retrieval, MIR 2010, Philadelphia, PA, United States, 3/29/10. https://doi.org/10.1145/1743384.1743415

Kalpathy-Cramer J, Hersh W. Multimodal medical image retrieval: Image categorization to improve search precision. In MIR 2010 - Proceedings of the 2010 ACM SIGMM International Conference on Multimedia Information Retrieval. 2010. p. 165-173. (MIR 2010 - Proceedings of the 2010 ACM SIGMM International Conference on Multimedia Information Retrieval). doi: 10.1145/1743384.1743415

@inproceedings{aa00d4935a4a43eebb71efc0312a2716,

title = "Multimodal medical image retrieval: Image categorization to improve search precision",

abstract = "Effective medical image retrieval can be useful in the clinical care of patients, education and research. Traditionally, image retrieval systems have been text-based, relying on the annotations or captions associated with the images. Although text-based information retrieval methods are mature and well researched, they are limited by the quality and availability of the annotations associated with the images. Advances in computer vision have led to methods for using the image itself as the search entity. However, the success of purely content-based techniques, when applied to a diverse set of clinical images, has been somewhat limited and these systems have not had much success in the medical domain. On the other hand, as demonstrated in recent years, a combination of text-based and content-based image retrieval techniques can achieve improved retrieval performance if combined effectively. There are many approaches to multimodal retrieval including early and late fusion of weighed results from the different search engines. In this work, we use automatic annotation based on visual attributes to label images as part of the indexing process and the subsequently use these labels to filter or reorder the results during the retrieval process. Labels for medical images can be categorized along three dimensions - imaging modality, anatomical location and image finding or pathology. Our previous research has indicated that the imaging modality is most easily identified using visual techniques whereas the caption or textual annotation frequently contains the finding or pathological information about the image. Thus, it is best to use visual methods to filter the modality and occasionally, anatomy while it is better to use the textual annotation to find the finding of interest. We have created a modality classifier for the weakly labeled images in our collection using a novel approach that combines affinity propagation for the selection of class exemplars, textons and patch-based descriptors as visual features and a NaiveBayes Nearest Neighbor technique for the classification of modality using visual features. We demonstrate significant improvement in precision attained using this technique for the ImageCLEF medical retrieval task 2009 using both our textual runs as well as runs from all participants in 2009.",

keywords = "Automatic annotation, Classification, Image retrieval, Machine learning",

author = "Jayashree Kalpathy-Cramer and William Hersh",

year = "2010",

doi = "10.1145/1743384.1743415",

language = "English (US)",

isbn = "9781605588155",

series = "MIR 2010 - Proceedings of the 2010 ACM SIGMM International Conference on Multimedia Information Retrieval",

pages = "165--173",

booktitle = "MIR 2010 - Proceedings of the 2010 ACM SIGMM International Conference on Multimedia Information Retrieval",

note = "2010 ACM SIGMM International Conference on Multimedia Information Retrieval, MIR 2010 ; Conference date: 29-03-2010 Through 31-03-2010",

}

TY - GEN

T1 - Multimodal medical image retrieval

T2 - 2010 ACM SIGMM International Conference on Multimedia Information Retrieval, MIR 2010

AU - Kalpathy-Cramer, Jayashree

AU - Hersh, William

PY - 2010

Y1 - 2010

N2 - Effective medical image retrieval can be useful in the clinical care of patients, education and research. Traditionally, image retrieval systems have been text-based, relying on the annotations or captions associated with the images. Although text-based information retrieval methods are mature and well researched, they are limited by the quality and availability of the annotations associated with the images. Advances in computer vision have led to methods for using the image itself as the search entity. However, the success of purely content-based techniques, when applied to a diverse set of clinical images, has been somewhat limited and these systems have not had much success in the medical domain. On the other hand, as demonstrated in recent years, a combination of text-based and content-based image retrieval techniques can achieve improved retrieval performance if combined effectively. There are many approaches to multimodal retrieval including early and late fusion of weighed results from the different search engines. In this work, we use automatic annotation based on visual attributes to label images as part of the indexing process and the subsequently use these labels to filter or reorder the results during the retrieval process. Labels for medical images can be categorized along three dimensions - imaging modality, anatomical location and image finding or pathology. Our previous research has indicated that the imaging modality is most easily identified using visual techniques whereas the caption or textual annotation frequently contains the finding or pathological information about the image. Thus, it is best to use visual methods to filter the modality and occasionally, anatomy while it is better to use the textual annotation to find the finding of interest. We have created a modality classifier for the weakly labeled images in our collection using a novel approach that combines affinity propagation for the selection of class exemplars, textons and patch-based descriptors as visual features and a NaiveBayes Nearest Neighbor technique for the classification of modality using visual features. We demonstrate significant improvement in precision attained using this technique for the ImageCLEF medical retrieval task 2009 using both our textual runs as well as runs from all participants in 2009.

AB - Effective medical image retrieval can be useful in the clinical care of patients, education and research. Traditionally, image retrieval systems have been text-based, relying on the annotations or captions associated with the images. Although text-based information retrieval methods are mature and well researched, they are limited by the quality and availability of the annotations associated with the images. Advances in computer vision have led to methods for using the image itself as the search entity. However, the success of purely content-based techniques, when applied to a diverse set of clinical images, has been somewhat limited and these systems have not had much success in the medical domain. On the other hand, as demonstrated in recent years, a combination of text-based and content-based image retrieval techniques can achieve improved retrieval performance if combined effectively. There are many approaches to multimodal retrieval including early and late fusion of weighed results from the different search engines. In this work, we use automatic annotation based on visual attributes to label images as part of the indexing process and the subsequently use these labels to filter or reorder the results during the retrieval process. Labels for medical images can be categorized along three dimensions - imaging modality, anatomical location and image finding or pathology. Our previous research has indicated that the imaging modality is most easily identified using visual techniques whereas the caption or textual annotation frequently contains the finding or pathological information about the image. Thus, it is best to use visual methods to filter the modality and occasionally, anatomy while it is better to use the textual annotation to find the finding of interest. We have created a modality classifier for the weakly labeled images in our collection using a novel approach that combines affinity propagation for the selection of class exemplars, textons and patch-based descriptors as visual features and a NaiveBayes Nearest Neighbor technique for the classification of modality using visual features. We demonstrate significant improvement in precision attained using this technique for the ImageCLEF medical retrieval task 2009 using both our textual runs as well as runs from all participants in 2009.

KW - Automatic annotation

KW - Classification

KW - Image retrieval

KW - Machine learning

UR - http://www.scopus.com/inward/record.url?scp=77952388718&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77952388718&partnerID=8YFLogxK

U2 - 10.1145/1743384.1743415

DO - 10.1145/1743384.1743415

M3 - Conference contribution

AN - SCOPUS:77952388718

SN - 9781605588155

T3 - MIR 2010 - Proceedings of the 2010 ACM SIGMM International Conference on Multimedia Information Retrieval

SP - 165

EP - 173

BT - MIR 2010 - Proceedings of the 2010 ACM SIGMM International Conference on Multimedia Information Retrieval

Y2 - 29 March 2010 through 31 March 2010

ER -

Multimodal medical image retrieval: Image categorization to improve search precision

Abstract

Publication series

Other

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this