A discrete probabilistic memory model for discovering dependencies in time

Sepp Hochreiter; Michael C. Mozer

Conference Proceedings

A discrete probabilistic memory model for discovering dependencies in time

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2001) 2130 661-668

DOI: 10.1007/3-540-44668-0_92

7Citations

7Readers

Get full text

Abstract

Many domains of machine learning involve discovering dependencies and structure over time. In the most complex of domains, long-term temporal dependencies are present. Neural network models suchas lstm have been developed to deal with long-term dependencies, but the continuous nature of neural networks is not well suited to discrete symbol processing tasks. Further, the mathematical underpinnings of neural networks are unclear, and gradient descent learning of recurrent neural networks seems particularly susceptible to local optima. We introduce a novel architecture for discovering dependencies in time. The architecture is formed by combining two variants of a hidden Markov model (hmm) – the factorial hmm and the input-output hmm – and adding a further strong constraint that requires the model to behave as a latch-and-store memory (the same constraint exploited in lstm). This model, called an miofhmm, can learn structure that other variants of the hmm cannot, and can generalize better than lstm on test sequences that have different statistical properties (different lengths, different types of noise) than training sequences. However, the miofhmm is slower to train and is more susceptible to local optima than lstm.

Cite

CITATION STYLE

APA

Hochreiter, S., & Mozer, M. C. (2001). A discrete probabilistic memory model for discovering dependencies in time. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2130, pp. 661–668). Springer Verlag. https://doi.org/10.1007/3-540-44668-0_92

A discrete probabilistic memory model for discovering dependencies in time

Abstract

Cite

Register to see more suggestions