An anticorrelation kernel for improved system combination in speaker verification

Luciana Ferrer, Kemal Sönmez, Elizabeth Shriberg

Research output: Contribution to conferencePaperpeer-review

3 Scopus citations

Abstract

This paper presents a method for training SVM-based classification systems for combination with other existing classification systems designed for the same task. Ideally, a new system should be designed such that, when combined with the existing systems, the resulting performance is optimized. To achieve this goal, we include a regularization term in the SVM objective function that aims to reduce the within-class correlation between the resulting scores and the scores produced by one of the existing systems, introducing a trade-off between such correlation and the system’s individual performance. That is, the SVM system “takes one for the team”, falling somewhat short of its best possible performance in order to be more complementary to the existing system. We report results on the NIST 2005 and 2006 speaker recognition evaluations (SRE) using three component systems: a standard UBM-GMM system, an MLLR-based system, and a prosodic system, and show that the proposed technique results in performance gains of 16% in EER and 23% in DCF.

Original languageEnglish (US)
StatePublished - Jan 1 2008
EventSpeaker and Language Recognition Workshop, Odyssey 2008 - Stellenbosch, South Africa
Duration: Jan 21 2008Jan 24 2008

Conference

ConferenceSpeaker and Language Recognition Workshop, Odyssey 2008
Country/TerritorySouth Africa
CityStellenbosch
Period1/21/081/24/08

ASJC Scopus subject areas

  • Human-Computer Interaction
  • Software
  • Signal Processing

Cite this