Abstract
Voice transformation (VT) aims to change one or more aspects of a speech signal while preserving linguistic information. A subset of VT, Voice conversion (VC) specifically aims to change a source speaker's speech in such a way that the generated output is perceived as a sentence uttered by a target speaker. Despite many years of research, VC systems still exhibit deficiencies in accurately mimicking a target speaker spectrally and prosodically, and simultaneously maintaining high speech quality. In this work we provide an overview of real-world applications, extensively study existing systems proposed in the literature, and discuss remaining challenges.
Original language | English (US) |
---|---|
Pages (from-to) | 65-82 |
Number of pages | 18 |
Journal | Speech Communication |
Volume | 88 |
DOIs | |
State | Published - Apr 1 2017 |
Keywords
- Overview
- Survey
- Voice conversion
ASJC Scopus subject areas
- Software
- Modeling and Simulation
- Communication
- Language and Linguistics
- Linguistics and Language
- Computer Vision and Pattern Recognition
- Computer Science Applications