Experiments with Language Models for Word Completion and Prediction in Hebrew

Yaakov HaCohen-Kerner; Asaf Applebaum; Jacob Bitterman

Journal Article

Experiments with Language Models for Word Completion and Prediction in Hebrew

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2014) 8686 450-462

DOI: 10.1007/978-3-319-10888-9_44

2Citations

7Readers

Get full text

Abstract

In this paper, we describe various language models (LMs) and combinations created to support word prediction and completion in Hebrew. We define and apply 5 general types of LMs: (1) Basic LMs (unigrams, bigrams, trigrams, and quadgrams), (2) Backoff LMs, (3) LMs Integrated with tagged LMs, (4) Interpolated LMs, and (5) Interpolated LMs Integrated with tagged LMs. 16 specific implementations of these LMs were compared using 3 types of Israeli web newspaper corpora. The foremost keystroke saving results were achieved with LMs of the most complex variety, the Interpolated LMs Integrated with tagged LMs. Therefore, we conclude that combining all strengths by creating a synthesis of all four basic LMs and the tagged LMs leads to the best results.

Author supplied keywords

Cite

CITATION STYLE

APA

HaCohen-Kerner, Y., Applebaum, A., & Bitterman, J. (2014). Experiments with Language Models for Word Completion and Prediction in Hebrew. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 8686, 450–462. https://doi.org/10.1007/978-3-319-10888-9_44

Experiments with Language Models for Word Completion and Prediction in Hebrew

Abstract

Author supplied keywords

Cite

Register to see more suggestions