Analyzing web log files of the health on the net honmedia search engine to define typical image search tasks for image retrieval evaluation

Henning Müller; Célia Boyer; Arnaud Gaudinat; William Hersh; Antoine Geissbuhler

Analyzing web log files of the health on the net honmedia search engine to define typical image search tasks for image retrieval evaluation

Henning Müller, Célia Boyer, Arnaud Gaudinat, William Hersh, Antoine Geissbuhler

Medical Informatics and Clinical Epidemiology

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

10 Scopus citations

Abstract

Medical institutions produce ever-increasing amount of diverse information. The digital form makes these data available for the use on more than a single patient. Images are no exception to this. However, less is known about how medical professionals search for visual medical information and how they want to use it outside of the context of a single patient. This article analyzes ten months of usage log files of the Health on the Net (HON) medical media search engine. Key words were extracted from all queries and the most frequent terms and subjects were identified. The dataset required much pre-treatment. Problems included national character sets, spelling errors and the use of terms in several languages. The results show that media search, particularly for images, was frequently used. The most common queries were for general concepts (e.g., heart, lung). To define realistic information needs for the ImageCLEFmed challenge evaluation (Cross Language Evaluation Forum medical image retrieval), we used frequent queries that were still specific enough to at least cover two of the three axes on modality, anatomic region, and pathology. Several research groups evaluated their image retrieval algorithms based on these defined topics.

Original language	English (US)
Title of host publication	MEDINFO 2007 - Proceedings of the 12th World Congress on Health (Medical) Informatics
Subtitle of host publication	Building Sustainable Health Systems
Publisher	IOS Press
Pages	1319-1323
Number of pages	5
ISBN (Print)	9781586037741
State	Published - 2007
Event	12th World Congress on Medical Informatics, MEDINFO 2007 - Brisbane, QLD, Australia Duration: Aug 20 2007 → Aug 24 2007

Publication series

Name	Studies in Health Technology and Informatics
Volume	129
ISSN (Print)	0926-9630
ISSN (Electronic)	1879-8365

Other

Other	12th World Congress on Medical Informatics, MEDINFO 2007
Country/Territory	Australia
City	Brisbane, QLD
Period	8/20/07 → 8/24/07

Keywords

image retrieval evaluation
log files analysis

ASJC Scopus subject areas

Biomedical Engineering
Health Informatics
Health Information Management

Cite this

Müller, H., Boyer, C., Gaudinat, A., Hersh, W., & Geissbuhler, A. (2007). Analyzing web log files of the health on the net honmedia search engine to define typical image search tasks for image retrieval evaluation. In MEDINFO 2007 - Proceedings of the 12th World Congress on Health (Medical) Informatics: Building Sustainable Health Systems (pp. 1319-1323). (Studies in Health Technology and Informatics; Vol. 129). IOS Press.

Analyzing web log files of the health on the net honmedia search engine to define typical image search tasks for image retrieval evaluation. / Müller, Henning; Boyer, Célia; Gaudinat, Arnaud et al.
MEDINFO 2007 - Proceedings of the 12th World Congress on Health (Medical) Informatics: Building Sustainable Health Systems. IOS Press, 2007. p. 1319-1323 (Studies in Health Technology and Informatics; Vol. 129).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Müller, H, Boyer, C, Gaudinat, A, Hersh, W & Geissbuhler, A 2007, Analyzing web log files of the health on the net honmedia search engine to define typical image search tasks for image retrieval evaluation. in MEDINFO 2007 - Proceedings of the 12th World Congress on Health (Medical) Informatics: Building Sustainable Health Systems. Studies in Health Technology and Informatics, vol. 129, IOS Press, pp. 1319-1323, 12th World Congress on Medical Informatics, MEDINFO 2007, Brisbane, QLD, Australia, 8/20/07.

Müller H, Boyer C, Gaudinat A, Hersh W, Geissbuhler A. Analyzing web log files of the health on the net honmedia search engine to define typical image search tasks for image retrieval evaluation. In MEDINFO 2007 - Proceedings of the 12th World Congress on Health (Medical) Informatics: Building Sustainable Health Systems. IOS Press. 2007. p. 1319-1323. (Studies in Health Technology and Informatics).

Müller, Henning ; Boyer, Célia ; Gaudinat, Arnaud et al. / Analyzing web log files of the health on the net honmedia search engine to define typical image search tasks for image retrieval evaluation. MEDINFO 2007 - Proceedings of the 12th World Congress on Health (Medical) Informatics: Building Sustainable Health Systems. IOS Press, 2007. pp. 1319-1323 (Studies in Health Technology and Informatics).

@inproceedings{3d355abd60bd43a8bcee044fbb0b5f71,

title = "Analyzing web log files of the health on the net honmedia search engine to define typical image search tasks for image retrieval evaluation",

abstract = "Medical institutions produce ever-increasing amount of diverse information. The digital form makes these data available for the use on more than a single patient. Images are no exception to this. However, less is known about how medical professionals search for visual medical information and how they want to use it outside of the context of a single patient. This article analyzes ten months of usage log files of the Health on the Net (HON) medical media search engine. Key words were extracted from all queries and the most frequent terms and subjects were identified. The dataset required much pre-treatment. Problems included national character sets, spelling errors and the use of terms in several languages. The results show that media search, particularly for images, was frequently used. The most common queries were for general concepts (e.g., heart, lung). To define realistic information needs for the ImageCLEFmed challenge evaluation (Cross Language Evaluation Forum medical image retrieval), we used frequent queries that were still specific enough to at least cover two of the three axes on modality, anatomic region, and pathology. Several research groups evaluated their image retrieval algorithms based on these defined topics.",

keywords = "image retrieval evaluation, log files analysis",

author = "Henning M{\"u}ller and C{\'e}lia Boyer and Arnaud Gaudinat and William Hersh and Antoine Geissbuhler",

year = "2007",

language = "English (US)",

isbn = "9781586037741",

series = "Studies in Health Technology and Informatics",

publisher = "IOS Press",

pages = "1319--1323",

booktitle = "MEDINFO 2007 - Proceedings of the 12th World Congress on Health (Medical) Informatics",

note = "12th World Congress on Medical Informatics, MEDINFO 2007 ; Conference date: 20-08-2007 Through 24-08-2007",

}

TY - GEN

T1 - Analyzing web log files of the health on the net honmedia search engine to define typical image search tasks for image retrieval evaluation

AU - Müller, Henning

AU - Boyer, Célia

AU - Gaudinat, Arnaud

AU - Hersh, William

AU - Geissbuhler, Antoine

PY - 2007

Y1 - 2007

N2 - Medical institutions produce ever-increasing amount of diverse information. The digital form makes these data available for the use on more than a single patient. Images are no exception to this. However, less is known about how medical professionals search for visual medical information and how they want to use it outside of the context of a single patient. This article analyzes ten months of usage log files of the Health on the Net (HON) medical media search engine. Key words were extracted from all queries and the most frequent terms and subjects were identified. The dataset required much pre-treatment. Problems included national character sets, spelling errors and the use of terms in several languages. The results show that media search, particularly for images, was frequently used. The most common queries were for general concepts (e.g., heart, lung). To define realistic information needs for the ImageCLEFmed challenge evaluation (Cross Language Evaluation Forum medical image retrieval), we used frequent queries that were still specific enough to at least cover two of the three axes on modality, anatomic region, and pathology. Several research groups evaluated their image retrieval algorithms based on these defined topics.

AB - Medical institutions produce ever-increasing amount of diverse information. The digital form makes these data available for the use on more than a single patient. Images are no exception to this. However, less is known about how medical professionals search for visual medical information and how they want to use it outside of the context of a single patient. This article analyzes ten months of usage log files of the Health on the Net (HON) medical media search engine. Key words were extracted from all queries and the most frequent terms and subjects were identified. The dataset required much pre-treatment. Problems included national character sets, spelling errors and the use of terms in several languages. The results show that media search, particularly for images, was frequently used. The most common queries were for general concepts (e.g., heart, lung). To define realistic information needs for the ImageCLEFmed challenge evaluation (Cross Language Evaluation Forum medical image retrieval), we used frequent queries that were still specific enough to at least cover two of the three axes on modality, anatomic region, and pathology. Several research groups evaluated their image retrieval algorithms based on these defined topics.

KW - image retrieval evaluation

KW - log files analysis

UR - http://www.scopus.com/inward/record.url?scp=35748980893&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=35748980893&partnerID=8YFLogxK

M3 - Conference contribution

C2 - 17911928

AN - SCOPUS:35748980893

SN - 9781586037741

T3 - Studies in Health Technology and Informatics

SP - 1319

EP - 1323

BT - MEDINFO 2007 - Proceedings of the 12th World Congress on Health (Medical) Informatics

PB - IOS Press

T2 - 12th World Congress on Medical Informatics, MEDINFO 2007

Y2 - 20 August 2007 through 24 August 2007

ER -

Analyzing web log files of the health on the net honmedia search engine to define typical image search tasks for image retrieval evaluation

Abstract

Publication series

Other

Keywords

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this