Normalized log-linear interpolation of backoff language models is efficient

3Citations
Citations of this article
102Readers
Mendeley users who have this article in their library.

Abstract

We prove that log-linearly interpolated backoff language models can be efficiently and exactly collapsed into a single normalized backoff model, contradicting Hsu (2007). While prior work reported that log-linear interpolation yields lower perplexity than linear interpolation, normalizing at query time was impractical. We normalize the model offline in advance, which is efficient due to a recurrence relationship between the normalizing factors. To tune interpolation weights, we apply Newton's method to this convex problem and show that the derivatives can be computed efficiently in a batch process. These findings are combined in new open-source interpolation tool, which is distributed with KenLM. With 21 out-of-domain corpora, log-linear interpolation yields 72.58 perplexity on TED talks, compared to 75.91 for linear interpolation.

Cite

CITATION STYLE

APA

Heafield, K., Geigle, C., Massung, S., & Schwartz, L. (2016). Normalized log-linear interpolation of backoff language models is efficient. In 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers (Vol. 2, pp. 876–886). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/p16-1083

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free