Automatic indexing of journal abstracts with latent semantic analysis

Joel Robert Adams, Steven Bedrick

Research output: Chapter in Book/Report/Conference proceedingConference contribution


The BioASQ “Task on Large-Scale Online Biomedical Semantic Indexing” charges participants with assigning semantic tags to biomedical journal abstracts. We present a system that takes as input a biomedical abstract and uses latent semantic analysis to identify similar documents in the MEDLINE database. The system then uses a novel ranking scheme to select a list of MeSH tags from candidates drawn from the most similar documents. Our approach achieved better than baseline performance in both precision and recall. We suggest several possible strategies to improve the system’s performance.

Original languageEnglish (US)
Title of host publicationExperimental IR Meets Multilinguality, Multimodality, and Interaction - 6th International Conference of the CLEF Association, CLEF 2015, Proceedings
EditorsEric San Juan, Jacques Savoy, Josiane Mothe, Jaap Kamps, Gareth J.F. Jones, Nicola Ferro, Karen Pinel-Sauvagnat, Linda Cappellato
Number of pages9
ISBN (Print)9783319240268
StatePublished - 2015
Event6th International Conference on Labs of the Evaluation Forum, CLEF 2015 - Toulouse, France
Duration: Sep 8 2015Sep 11 2015

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349


Other6th International Conference on Labs of the Evaluation Forum, CLEF 2015

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science


Dive into the research topics of 'Automatic indexing of journal abstracts with latent semantic analysis'. Together they form a unique fingerprint.

Cite this