POS Tags and Decision Trees for Language Modeling

Peter A. Heeman

POS Tags and Decision Trees for Language Modeling

Peter A. Heeman

Institute on Development and Disability

Research output: Contribution to conference › Paper › peer-review

16 Scopus citations

Abstract

Language models for speech recognition concentrate solely on recognizing the words that were spoken. In this paper, we advocate redefining the speech recognition problem so that its goal is to find both the best sequence of words and their POS tags, and thus incorporate POS tagging. To use POS tags effectively, we use clustering and decision tree algorithms, which allow generalizations between POS tags and words to be effectively used in estimating the probability distributions. We show that our POS model gives a reduction in word error rate and perplexity for the Trains corpus in comparison to word and class-based approaches. By using the Wall Street Journal corpus, we show that this approach scales up when more training data is available.

Original language	English (US)
Pages	129-137
Number of pages	9
State	Published - 1999
Event	1999 Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, EMNLP 1999 - College Park, United States Duration: Jun 21 1999 → Jun 22 1999

Conference

Conference	1999 Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, EMNLP 1999
Country/Territory	United States
City	College Park
Period	6/21/99 → 6/22/99

ASJC Scopus subject areas

Computer Science Applications
Information Systems
Computational Theory and Mathematics

Cite this

@conference{11e2d8693ab04af9842ac1b4c658bdf6,

title = "POS Tags and Decision Trees for Language Modeling",

abstract = "Language models for speech recognition concentrate solely on recognizing the words that were spoken. In this paper, we advocate redefining the speech recognition problem so that its goal is to find both the best sequence of words and their POS tags, and thus incorporate POS tagging. To use POS tags effectively, we use clustering and decision tree algorithms, which allow generalizations between POS tags and words to be effectively used in estimating the probability distributions. We show that our POS model gives a reduction in word error rate and perplexity for the Trains corpus in comparison to word and class-based approaches. By using the Wall Street Journal corpus, we show that this approach scales up when more training data is available.",

author = "Heeman, {Peter A.}",

note = "Publisher Copyright: {\textcopyright} 1999 Proceedings of the 1999 Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, EMNLP 1999. All rights reserved.; 1999 Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, EMNLP 1999 ; Conference date: 21-06-1999 Through 22-06-1999",

year = "1999",

language = "English (US)",

pages = "129--137",

}

TY - CONF

T1 - POS Tags and Decision Trees for Language Modeling

AU - Heeman, Peter A.

PY - 1999

Y1 - 1999

N2 - Language models for speech recognition concentrate solely on recognizing the words that were spoken. In this paper, we advocate redefining the speech recognition problem so that its goal is to find both the best sequence of words and their POS tags, and thus incorporate POS tagging. To use POS tags effectively, we use clustering and decision tree algorithms, which allow generalizations between POS tags and words to be effectively used in estimating the probability distributions. We show that our POS model gives a reduction in word error rate and perplexity for the Trains corpus in comparison to word and class-based approaches. By using the Wall Street Journal corpus, we show that this approach scales up when more training data is available.

AB - Language models for speech recognition concentrate solely on recognizing the words that were spoken. In this paper, we advocate redefining the speech recognition problem so that its goal is to find both the best sequence of words and their POS tags, and thus incorporate POS tagging. To use POS tags effectively, we use clustering and decision tree algorithms, which allow generalizations between POS tags and words to be effectively used in estimating the probability distributions. We show that our POS model gives a reduction in word error rate and perplexity for the Trains corpus in comparison to word and class-based approaches. By using the Wall Street Journal corpus, we show that this approach scales up when more training data is available.

UR - http://www.scopus.com/inward/record.url?scp=0039623602&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0039623602&partnerID=8YFLogxK

M3 - Paper

AN - SCOPUS:0039623602

SP - 129

EP - 137

T2 - 1999 Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora, EMNLP 1999

Y2 - 21 June 1999 through 22 June 1999

ER -

POS Tags and Decision Trees for Language Modeling

Abstract

Conference

ASJC Scopus subject areas

Other files and links

Fingerprint

Cite this