TY - GEN
T1 - Stochastic modeling of spectral adjustment for high quality pitch modification
AU - Kain, Alexander
AU - Stylianou, Yannis
N1 - Publisher Copyright:
© 2000 IEEE.
PY - 2000
Y1 - 2000
N2 - We present a new algorithm for adjusting the magnitude spectrum when the fundamental frequency (F0) of a speech signal is altered. The algorithm exploits the correlation between F0 and the magnitude spectrum of speech as represented by line spectral frequencies (LSFs). This correlation is class-dependent, and thus a broad classification of the input is achieved by a Gaussian mixture model (GMM). The within-class dependencies of LSFs on F0 values are captured by constructing their joint probability densities using a series of GMMs, one for each speech class. The proposed system is used for post-processing the pitch modified signal. Perceptual tests showed that the addition of this post-processing system improves the naturalness of the pitch modified signal for large pitch modification factors.
AB - We present a new algorithm for adjusting the magnitude spectrum when the fundamental frequency (F0) of a speech signal is altered. The algorithm exploits the correlation between F0 and the magnitude spectrum of speech as represented by line spectral frequencies (LSFs). This correlation is class-dependent, and thus a broad classification of the input is achieved by a Gaussian mixture model (GMM). The within-class dependencies of LSFs on F0 values are captured by constructing their joint probability densities using a series of GMMs, one for each speech class. The proposed system is used for post-processing the pitch modified signal. Perceptual tests showed that the addition of this post-processing system improves the naturalness of the pitch modified signal for large pitch modification factors.
UR - http://www.scopus.com/inward/record.url?scp=0033693289&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0033693289&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2000.859118
DO - 10.1109/ICASSP.2000.859118
M3 - Conference contribution
AN - SCOPUS:0033693289
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 949
EP - 952
BT - Signal Processing Theory and Methods IIAudio and ElectroacusticsSpeech Processing I
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 25th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2000
Y2 - 5 June 2000 through 9 June 2000
ER -