Linear Memory Networks

9Citations
Citations of this article
14Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Recurrent neural networks can learn complex transduction problems that require maintaining and actively exploiting a memory of their inputs. Such models traditionally consider memory and input-output functionalities indissolubly entangled. We introduce a novel recurrent architecture based on the conceptual separation between the functional input-output transformation and the memory mechanism, showing how they can be implemented through different neural components. By building on such conceptualization, we introduce the Linear Memory Network, a recurrent model comprising a feedforward neural network, realizing the non-linear functional transformation, and a linear autoencoder for sequences, implementing the memory component. The resulting architecture can be efficiently trained by building on closed-form solutions to linear optimization problems. Further, by exploiting equivalence results between feedforward and recurrent neural networks we devise a pretraining schema for the proposed architecture. Experiments on polyphonic music datasets show that the pretraining schema consistently improves the performance of the LMN and outperforms LSTM architectures by up to 3.7 frame-level accuracy percentage points despite using about one fifth of the number of parameters of an equivalent LSTM.

Cite

CITATION STYLE

APA

Bacciu, D., Carta, A., & Sperduti, A. (2019). Linear Memory Networks. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11727 LNCS, pp. 513–525). Springer Verlag. https://doi.org/10.1007/978-3-030-30487-4_40

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free