Supervised grapheme-to-phoneme conversion of orthographic schwas in Hindi and Punjabi

1Citations
Citations of this article
93Readers
Mendeley users who have this article in their library.

Abstract

Hindi grapheme-to-phoneme (G2P) conversion is mostly trivial, with one exception: whether a schwa represented in the orthography is pronounced or unpronounced (deleted). Previous work has attempted to predict schwa deletion in a rule-based fashion using prosodic or phonetic analysis. We present the first statistical schwa deletion classifier for Hindi, which relies solely on the orthography as the input and outperforms previous approaches. We trained our model on a newly-compiled pronunciation lexicon extracted from various online dictionaries. Our best Hindi model achieves state of the art performance, and also achieves good performance on a closely related language, Punjabi, without modification.

Cite

CITATION STYLE

APA

Arora, A., Gessler, L., & Schneider, N. (2020). Supervised grapheme-to-phoneme conversion of orthographic schwas in Hindi and Punjabi. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 7791–7795). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2020.acl-main.696

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free