Combining standard and throat microphones for robust speech recognition

Martin Graciarena; Horacio Franco; Kemal Sonmez; Harry Bratt

doi:10.1109/LSP.2003.808549

Combining standard and throat microphones for robust speech recognition

Martin Graciarena, Horacio Franco, Kemal Sonmez, Harry Bratt

Research output: Contribution to journal › Article › peer-review

94 Scopus citations

Abstract

We present a method to combine the standard and throat microphone signals for robust speech recognition in noisy environments. Our approach is to use the probabilistic optimum filter (POF) mapping algorithm to estimate the standard microphone clean-speech feature vectors, used by standard speech recognizers, from both microphones' noisy-speech feature vectors. A small untranscribed "stereo" database (noisy and clean simultaneous recordings) is required to train the POF mappings. In continuous-speech recognition experiments using SRI International's DECIPHER recognition system, both using artificially added noise and using recorded noisy speech, the combined-microphone approach significantly outperforms the single-microphone approach.

Original language	English (US)
Pages (from-to)	72-74
Number of pages	3
Journal	IEEE Signal Processing Letters
Volume	10
Issue number	3
DOIs	https://doi.org/10.1109/LSP.2003.808549
State	Published - Mar 2003
Externally published	Yes

Keywords

Noise robustness
Probabilistic optimum filtering
Speech recognition
Throat microphone

ASJC Scopus subject areas

Signal Processing
Applied Mathematics
Electrical and Electronic Engineering

Access to Document

10.1109/LSP.2003.808549

Cite this

@article{6c596fe8fa794bd190e92b3ae785ac72,

title = "Combining standard and throat microphones for robust speech recognition",

abstract = "We present a method to combine the standard and throat microphone signals for robust speech recognition in noisy environments. Our approach is to use the probabilistic optimum filter (POF) mapping algorithm to estimate the standard microphone clean-speech feature vectors, used by standard speech recognizers, from both microphones' noisy-speech feature vectors. A small untranscribed {"}stereo{"} database (noisy and clean simultaneous recordings) is required to train the POF mappings. In continuous-speech recognition experiments using SRI International's DECIPHER recognition system, both using artificially added noise and using recorded noisy speech, the combined-microphone approach significantly outperforms the single-microphone approach.",

keywords = "Noise robustness, Probabilistic optimum filtering, Speech recognition, Throat microphone",

author = "Martin Graciarena and Horacio Franco and Kemal Sonmez and Harry Bratt",

note = "Funding Information: Manuscript received March 25, 2002; revised August 12, 2002. This work was supported in part by the Defense Advanced Research Projects Agency through the ROAR project under Contract N66001-99-D-8504 and in part by internal funding from SRI International. The associate editor coordinating the review of this manuscript and approving it for publication was Dr. Steven L. Gay.",

year = "2003",

month = mar,

doi = "10.1109/LSP.2003.808549",

language = "English (US)",

volume = "10",

pages = "72--74",

journal = "IEEE Signal Processing Letters",

issn = "1070-9908",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

number = "3",

}

TY - JOUR

T1 - Combining standard and throat microphones for robust speech recognition

AU - Graciarena, Martin

AU - Franco, Horacio

AU - Sonmez, Kemal

AU - Bratt, Harry

N1 - Funding Information: Manuscript received March 25, 2002; revised August 12, 2002. This work was supported in part by the Defense Advanced Research Projects Agency through the ROAR project under Contract N66001-99-D-8504 and in part by internal funding from SRI International. The associate editor coordinating the review of this manuscript and approving it for publication was Dr. Steven L. Gay.

PY - 2003/3

Y1 - 2003/3

N2 - We present a method to combine the standard and throat microphone signals for robust speech recognition in noisy environments. Our approach is to use the probabilistic optimum filter (POF) mapping algorithm to estimate the standard microphone clean-speech feature vectors, used by standard speech recognizers, from both microphones' noisy-speech feature vectors. A small untranscribed "stereo" database (noisy and clean simultaneous recordings) is required to train the POF mappings. In continuous-speech recognition experiments using SRI International's DECIPHER recognition system, both using artificially added noise and using recorded noisy speech, the combined-microphone approach significantly outperforms the single-microphone approach.

AB - We present a method to combine the standard and throat microphone signals for robust speech recognition in noisy environments. Our approach is to use the probabilistic optimum filter (POF) mapping algorithm to estimate the standard microphone clean-speech feature vectors, used by standard speech recognizers, from both microphones' noisy-speech feature vectors. A small untranscribed "stereo" database (noisy and clean simultaneous recordings) is required to train the POF mappings. In continuous-speech recognition experiments using SRI International's DECIPHER recognition system, both using artificially added noise and using recorded noisy speech, the combined-microphone approach significantly outperforms the single-microphone approach.

KW - Noise robustness

KW - Probabilistic optimum filtering

KW - Speech recognition

KW - Throat microphone

UR - http://www.scopus.com/inward/record.url?scp=0037341762&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0037341762&partnerID=8YFLogxK

U2 - 10.1109/LSP.2003.808549

DO - 10.1109/LSP.2003.808549

M3 - Article

AN - SCOPUS:0037341762

SN - 1070-9908

VL - 10

SP - 72

EP - 74

JO - IEEE Signal Processing Letters

JF - IEEE Signal Processing Letters

IS - 3

ER -

Combining standard and throat microphones for robust speech recognition

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this