An overview of voice conversion systems

Seyed Hamidreza Mohammadi, Alexander Kain

Research output: Contribution to journalReview articlepeer-review

204 Scopus citations

Abstract

Voice transformation (VT) aims to change one or more aspects of a speech signal while preserving linguistic information. A subset of VT, Voice conversion (VC) specifically aims to change a source speaker's speech in such a way that the generated output is perceived as a sentence uttered by a target speaker. Despite many years of research, VC systems still exhibit deficiencies in accurately mimicking a target speaker spectrally and prosodically, and simultaneously maintaining high speech quality. In this work we provide an overview of real-world applications, extensively study existing systems proposed in the literature, and discuss remaining challenges.

Original languageEnglish (US)
Pages (from-to)65-82
Number of pages18
JournalSpeech Communication
Volume88
DOIs
StatePublished - Apr 1 2017

Keywords

  • Overview
  • Survey
  • Voice conversion

ASJC Scopus subject areas

  • Software
  • Modeling and Simulation
  • Communication
  • Language and Linguistics
  • Linguistics and Language
  • Computer Vision and Pattern Recognition
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'An overview of voice conversion systems'. Together they form a unique fingerprint.

Cite this