Restoring Hebrew DiacriticsWithout a Dictionary

1Citations
Citations of this article
24Readers
Mendeley users who have this article in their library.

Abstract

We demonstrate that it is feasible to accurately diacritize Hebrew script without any human-curated resources other than plain diacritized text. We present NAKDIMON, a two-layer character-level LSTM, that performs on par with much more complicated curationdependent systems, across a diverse array of modern Hebrew sources. The model is accompanied by a training set and a test set, collected from diverse sources.

Cite

CITATION STYLE

APA

Gershuni, E., & Pinter, Y. (2022). Restoring Hebrew DiacriticsWithout a Dictionary. In Findings of the Association for Computational Linguistics: NAACL 2022 - Findings (pp. 1010–1018). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2022.findings-naacl.75

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free