Abstract
We prove that log-linearly interpolated backoff language models can be efficiently and exactly collapsed into a single normalized backoff model, contradicting Hsu (2007). While prior work reported that log-linear interpolation yields lower perplexity than linear interpolation, normalizing at query time was impractical. We normalize the model offline in advance, which is efficient due to a recurrence relationship between the normalizing factors. To tune interpolation weights, we apply Newton's method to this convex problem and show that the derivatives can be computed efficiently in a batch process. These findings are combined in new open-source interpolation tool, which is distributed with KenLM. With 21 out-of-domain corpora, log-linear interpolation yields 72.58 perplexity on TED talks, compared to 75.91 for linear interpolation.
Cite
CITATION STYLE
Heafield, K., Geigle, C., Massung, S., & Schwartz, L. (2016). Normalized log-linear interpolation of backoff language models is efficient. In 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers (Vol. 2, pp. 876–886). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/p16-1083
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.