Abstract
Correct stress placement is important in text-to-speech systems, in terms of both the overall accuracy and the naturalness of pronunciation. In this paper, we formulate stress assignment as a sequence prediction problem. We represent words as sequences of substrings, and use the substrings as features in a Support Vector Machine (SVM) ranker, which is trained to rank possible stress patterns. The ranking approach facilitates inclusion of arbitrary features over both the input sequence and output stress pattern. Our system advances the current state-of-the-art, predicting primary stress in English, German, and Dutch with up to 98% word accuracy on phonemes, and 96% on letters. The system is also highly accurate in predicting secondary stress. Finally, when applied in tandem with an L2P system, it substantially reduces the word error rate when predicting both phonemes and stress. © 2009 ACL and AFNLP.
Cite
CITATION STYLE
Dou, Q., Bergsma, S., Jiampojamarn, S., & Kondrak, G. (2009). A ranking approach to stress prediction for letter-to-phoneme conversion. In ACL-IJCNLP 2009 - Joint Conf. of the 47th Annual Meeting of the Association for Computational Linguistics and 4th Int. Joint Conf. on Natural Language Processing of the AFNLP, Proceedings of the Conf. (pp. 118–126). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1687878.1687897
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.