Abstract
When speech understanding systems are used in real applications, they encounter incidental noise generated by the speaker and the environment. Such noises can cause serious problems for speech recognizers not designed to cope with them. We attempt to model these noises by training HMM "noise words" to match classes of noises. The noise words were incorporated into the Sphinx system and performance compared to the system without noise words. Initial results suggest that the technique does increase system performance significantly.
Cite
CITATION STYLE
Ward, W. (1989). Modelling Non-verbal Sounds for Speech Recognition. In Speech and Natural Language, Proceedings of a Workshop (pp. 47–50). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1075434.1075443
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.