Abstract
As an effort to make prosody useful in spontaneous speech recognition, we adopt a quasi-continuous prosodic annotation and accordingly design a prosody-dependent acoustic model to improve ASR performances. We propose a variable-parameter Hidden Markov Models, modeling the mean vector as a function of the prosody variable through a polynomial regression model. The prosodically-adapted acoustic models are used to re-score the N-best output from a standard ASR, according to the prosody variable assigned by an automatic prosody detector. Experiments on the Buckeye corpus demonstrate the effectiveness of our approach.
Author supplied keywords
Cite
CITATION STYLE
Huang, J. T., Huang, P. S., Mo, Y., Hasegawa-Johnson, M., & Cole, J. (2010). Prosody-dependent acoustic modeling using variable-parameter hidden markov models. In Proceedings of the International Conference on Speech Prosody. International Speech Communication Association. https://doi.org/10.21437/speechprosody.2010-101
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.