TY - GEN
T1 - Transmutative voice conversion
AU - Mohammadi, Seyed Hamidreza
AU - Kain, Alexander
PY - 2013/10/18
Y1 - 2013/10/18
N2 - There are two types of voice conversion (VC) systems: generative and transmutative. A generative VC system typically uses a compact parametrization of speech and maps input to output parameters directly; however, the relative low dimensionality of the underlying speech model reduces quality. On the other hand, a transmutative VC system modifies high-dimensional features of a high-fidelity speech model, leaving critical details unmodified. Two versions of transmutative VC approach are implemented and compared to a generative VC approach. The results show that the implemented transmutative VC is significantly better compared to generative VC in terms of quality. The difference between the two VC methods regarding recognition scores are insignificant.
AB - There are two types of voice conversion (VC) systems: generative and transmutative. A generative VC system typically uses a compact parametrization of speech and maps input to output parameters directly; however, the relative low dimensionality of the underlying speech model reduces quality. On the other hand, a transmutative VC system modifies high-dimensional features of a high-fidelity speech model, leaving critical details unmodified. Two versions of transmutative VC approach are implemented and compared to a generative VC approach. The results show that the implemented transmutative VC is significantly better compared to generative VC in terms of quality. The difference between the two VC methods regarding recognition scores are insignificant.
KW - frequency warping
KW - speech transformation
KW - voice conversion
UR - http://www.scopus.com/inward/record.url?scp=84890475857&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84890475857&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2013.6639003
DO - 10.1109/ICASSP.2013.6639003
M3 - Conference contribution
AN - SCOPUS:84890475857
SN - 9781479903566
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 6920
EP - 6924
BT - 2013 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013 - Proceedings
T2 - 2013 38th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2013
Y2 - 26 May 2013 through 31 May 2013
ER -