TY - GEN
T1 - Compression of acoustic inventories using asynchronous interpolation
AU - Kain, A. B.
AU - Van Santen, J. P.H.
N1 - Funding Information:
This research was conducted with support from NSF Grants 0117911 "Making Dysarthric Speech Intelligible" and 0082718 "Modeling Degree of Articulation for Speech Synthesis". We are grateful to Gilead Cohen and Johan Wouters for helpful discussions, and to Mike Macon for inspiring us to work on this topic.
Publisher Copyright:
© 2002 IEEE.
PY - 2002
Y1 - 2002
N2 - A compression method is proposed that takes advantage of a powerful property of acoustic unit inventories: In the appropriate acoustic space, units that share a (context-dependent or -independent) phoneme label must be close to a vector phoneme template associated with the phoneme. The method approximates units by interpolation between templates. The interpolation operation involves two asynchronous weight functions operating on the template. One is associated with spectral peak locations, the second with spectral balance. This enables approximating transitions such as [i:]→a[v], in which formant movement precedes frication onset. The algorithm guarantees smooth concatenation points.
AB - A compression method is proposed that takes advantage of a powerful property of acoustic unit inventories: In the appropriate acoustic space, units that share a (context-dependent or -independent) phoneme label must be close to a vector phoneme template associated with the phoneme. The method approximates units by interpolation between templates. The interpolation operation involves two asynchronous weight functions operating on the template. One is associated with spectral peak locations, the second with spectral balance. This enables approximating transitions such as [i:]→a[v], in which formant movement precedes frication onset. The algorithm guarantees smooth concatenation points.
UR - http://www.scopus.com/inward/record.url?scp=70349220348&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=70349220348&partnerID=8YFLogxK
U2 - 10.1109/WSS.2002.1224378
DO - 10.1109/WSS.2002.1224378
M3 - Conference contribution
AN - SCOPUS:70349220348
T3 - Proceedings of 2002 IEEE Workshop on Speech Synthesis
SP - 83
EP - 86
BT - Proceedings of 2002 IEEE Workshop on Speech Synthesis
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2002 IEEE Workshop on Speech Synthesis
Y2 - 11 September 2002 through 13 September 2002
ER -