The Custom Decay Language Model for long range dependencies

Mittul Singh; Clayton Greenberg; Dietrich Klakow

Book Chapter

The Custom Decay Language Model for long range dependencies

Springer Verlag, (2016), 343-351

DOI: 10.1007/978-3-319-45510-5_39

1Citations

3Readers

Get full text

Abstract

Significant correlations between words can be observed over long distances, but contemporary language models like N-grams, Skip grams, and recurrent neural network language models (RNNLMs) require a large number of parameters to capture these dependencies, if the models can do so at all. In this paper, we propose the Custom Decay Language Model (CDLM), which captures long range correlations while maintaining sub-linear increase in parameters with vocabulary size. This model has a robust and stable training procedure (unlike RNNLMs), a more powerful modeling scheme than the Skip models, and a customizable representation. In perplexity experiments, CDLMs outperform the Skip models using fewer number of parameters. A CDLM also nominally outperformed a similar-sized RNNLM, meaning that it learned as much as the RNNLM but without recurrence.

Author supplied keywords

Cite

CITATION STYLE

APA

Singh, M., Greenberg, C., & Klakow, D. (2016). The Custom Decay Language Model for long range dependencies. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9924 LNCS, pp. 343–351). Springer Verlag. https://doi.org/10.1007/978-3-319-45510-5_39

The Custom Decay Language Model for long range dependencies

Abstract

Author supplied keywords

Cite

Register to see more suggestions