Normalized log-linear interpolation of backoff language models is efficient

Kenneth Heafield; Chase Geigle; Sean Massung; Lane Schwartz

Conference ProceedingsOPEN ACCESS

Normalized log-linear interpolation of backoff language models is efficient

54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers (2016) 2 876-886

DOI: 10.18653/v1/p16-1083

3Citations

102Readers

Abstract

We prove that log-linearly interpolated backoff language models can be efficiently and exactly collapsed into a single normalized backoff model, contradicting Hsu (2007). While prior work reported that log-linear interpolation yields lower perplexity than linear interpolation, normalizing at query time was impractical. We normalize the model offline in advance, which is efficient due to a recurrence relationship between the normalizing factors. To tune interpolation weights, we apply Newton's method to this convex problem and show that the derivatives can be computed efficiently in a batch process. These findings are combined in new open-source interpolation tool, which is distributed with KenLM. With 21 out-of-domain corpora, log-linear interpolation yields 72.58 perplexity on TED talks, compared to 75.91 for linear interpolation.

Cite

CITATION STYLE

APA

Heafield, K., Geigle, C., Massung, S., & Schwartz, L. (2016). Normalized log-linear interpolation of backoff language models is efficient. In 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers (Vol. 2, pp. 876–886). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/p16-1083

Normalized log-linear interpolation of backoff language models is efficient

Abstract

Cite

Register to see more suggestions