Multiple-pronunciation dictionaries are often used by automatic speech recognition systems in order to account for different speaking styles. In this paper, two methods based on statistical machine translation (SMT) are used to generate multiple pronunciations from the canonical pronunciation of a word. In the first method, a machine translation tool is used to perform phoneme-to-phoneme (p2p) conversion and derive variants from a given canonical pronunciation. The second method is based on a pivot method proposed for the paraphrase extraction task. The two methods are compared under different training conditions which allow single or multiple pronunciations in the training set, and their performance is evaluated in terms of recall and precision measures. © 2010 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Karanasou, P., & Lamel, L. (2010). Comparing SMT methods for automatic generation of pronunciation variants. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6233 LNAI, pp. 167–178). https://doi.org/10.1007/978-3-642-14770-8_20
Mendeley helps you to discover research relevant for your work.