This paper tests speech recognition using prosody dependent allophone models. The log likehoods of various prosodically labeled phonemes are calculated using Baum-Welsh re-estimation. These log likehoods are then compared to log likehoods of non-prosodically labeled phonemes. Based on the comparison of these log likehoods, it can be concluded that modeling all prosodic information directly in the vowel model leads to improvement in the model. Consonants, on the other hand, split naturally into three categories, strengthened, lengthened and neutral.
CITATION STYLE
Borys, S. (2003). The importance of prosodic factors in phoneme modeling with applications to speech recognition. In Proceedings of the 2003 Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics - Student Research Workshop, HLT-NAACL 2003 (pp. 7–12). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1073416.1073418
Mendeley helps you to discover research relevant for your work.