An anticorrelation kernel for improved system combination in speaker verification

Luciana Ferrer; Kemal Sönmez; Elizabeth Shriberg

An anticorrelation kernel for improved system combination in speaker verification

Luciana Ferrer, Kemal Sönmez, Elizabeth Shriberg

Institute on Development and Disability

Research output: Contribution to conference › Paper › peer-review

3 Scopus citations

Abstract

This paper presents a method for training SVM-based classification systems for combination with other existing classification systems designed for the same task. Ideally, a new system should be designed such that, when combined with the existing systems, the resulting performance is optimized. To achieve this goal, we include a regularization term in the SVM objective function that aims to reduce the within-class correlation between the resulting scores and the scores produced by one of the existing systems, introducing a trade-off between such correlation and the system’s individual performance. That is, the SVM system “takes one for the team”, falling somewhat short of its best possible performance in order to be more complementary to the existing system. We report results on the NIST 2005 and 2006 speaker recognition evaluations (SRE) using three component systems: a standard UBM-GMM system, an MLLR-based system, and a prosodic system, and show that the proposed technique results in performance gains of 16% in EER and 23% in DCF.

Original language	English (US)
State	Published - Jan 1 2008
Event	Speaker and Language Recognition Workshop, Odyssey 2008 - Stellenbosch, South Africa Duration: Jan 21 2008 → Jan 24 2008

Conference

Conference	Speaker and Language Recognition Workshop, Odyssey 2008
Country/Territory	South Africa
City	Stellenbosch
Period	1/21/08 → 1/24/08

ASJC Scopus subject areas

Human-Computer Interaction
Software
Signal Processing

Cite this

@conference{76b2b04ea5ca4abba527596a240a1705,

title = "An anticorrelation kernel for improved system combination in speaker verification",

abstract = "This paper presents a method for training SVM-based classification systems for combination with other existing classification systems designed for the same task. Ideally, a new system should be designed such that, when combined with the existing systems, the resulting performance is optimized. To achieve this goal, we include a regularization term in the SVM objective function that aims to reduce the within-class correlation between the resulting scores and the scores produced by one of the existing systems, introducing a trade-off between such correlation and the system{\textquoteright}s individual performance. That is, the SVM system “takes one for the team”, falling somewhat short of its best possible performance in order to be more complementary to the existing system. We report results on the NIST 2005 and 2006 speaker recognition evaluations (SRE) using three component systems: a standard UBM-GMM system, an MLLR-based system, and a prosodic system, and show that the proposed technique results in performance gains of 16% in EER and 23% in DCF.",

author = "Luciana Ferrer and Kemal S{\"o}nmez and Elizabeth Shriberg",

year = "2008",

month = jan,

day = "1",

language = "English (US)",

note = "Speaker and Language Recognition Workshop, Odyssey 2008 ; Conference date: 21-01-2008 Through 24-01-2008",

}

TY - CONF

T1 - An anticorrelation kernel for improved system combination in speaker verification

AU - Ferrer, Luciana

AU - Sönmez, Kemal

AU - Shriberg, Elizabeth

PY - 2008/1/1

Y1 - 2008/1/1

N2 - This paper presents a method for training SVM-based classification systems for combination with other existing classification systems designed for the same task. Ideally, a new system should be designed such that, when combined with the existing systems, the resulting performance is optimized. To achieve this goal, we include a regularization term in the SVM objective function that aims to reduce the within-class correlation between the resulting scores and the scores produced by one of the existing systems, introducing a trade-off between such correlation and the system’s individual performance. That is, the SVM system “takes one for the team”, falling somewhat short of its best possible performance in order to be more complementary to the existing system. We report results on the NIST 2005 and 2006 speaker recognition evaluations (SRE) using three component systems: a standard UBM-GMM system, an MLLR-based system, and a prosodic system, and show that the proposed technique results in performance gains of 16% in EER and 23% in DCF.

AB - This paper presents a method for training SVM-based classification systems for combination with other existing classification systems designed for the same task. Ideally, a new system should be designed such that, when combined with the existing systems, the resulting performance is optimized. To achieve this goal, we include a regularization term in the SVM objective function that aims to reduce the within-class correlation between the resulting scores and the scores produced by one of the existing systems, introducing a trade-off between such correlation and the system’s individual performance. That is, the SVM system “takes one for the team”, falling somewhat short of its best possible performance in order to be more complementary to the existing system. We report results on the NIST 2005 and 2006 speaker recognition evaluations (SRE) using three component systems: a standard UBM-GMM system, an MLLR-based system, and a prosodic system, and show that the proposed technique results in performance gains of 16% in EER and 23% in DCF.

UR - http://www.scopus.com/inward/record.url?scp=85073163671&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85073163671&partnerID=8YFLogxK

M3 - Paper

T2 - Speaker and Language Recognition Workshop, Odyssey 2008

Y2 - 21 January 2008 through 24 January 2008

ER -

An anticorrelation kernel for improved system combination in speaker verification

Abstract

Conference

ASJC Scopus subject areas

Other files and links

Cite this