Sliced recurrent neural networks

ArXiv: 1808.06170
29Citations
Citations of this article
454Readers
Mendeley users who have this article in their library.

Abstract

Recurrent neural networks have achieved great success in many NLP tasks. However, they have difficulty in parallelization because of the recurrent structure, so it takes much time to train RNNs. In this paper, we introduce sliced recurrent neural networks (SRNNs), which could be parallelized by slicing the sequences into many subsequences. SRNNs have the ability to obtain high-level information through multiple layers with few extra parameters. We prove that the standard RNN is a special case of the SRNN when we use linear activation functions. Without changing the recurrent units, SRNNs are 136 times as fast as standard RNNs and could be even faster when we train longer sequences. Experiments on six large-scale sentiment analysis datasets show that SRNNs achieve better performance than standard RNNs.

Cite

CITATION STYLE

APA

Yu, Z., & Liu, G. (2018). Sliced recurrent neural networks. In COLING 2018 - 27th International Conference on Computational Linguistics, Proceedings (pp. 2953–2964). Association for Computational Linguistics (ACL).

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free