Molding CNNs for text: Non-linear, non-consecutive convolutions

Tao Lei; Regina Barzilay; Tommi Jaakkola

Conference ProceedingsOPEN ACCESS

Molding CNNs for text: Non-linear, non-consecutive convolutions

Conference Proceedings - EMNLP 2015: Conference on Empirical Methods in Natural Language Processing (2015) 1565-1575

DOI: 10.18653/v1/d15-1180

69Citations

271Readers

Abstract

The success of deep learning often derives from well-chosen operational building blocks. In this work, we revise the temporal convolution operation in CNNs to better adapt it to text processing. Instead of concatenating word representations, we appeal to tensor algebra and use low-rank n-gram tensors to directly exploit interactions between words already at the convolution stage. Moreover, we extend the n-gram convolution to non-consecutive words to recognize patterns with intervening words. Through a combination of lowrank tensors, and pattern weighting, we can efficiently evaluate the resulting convolution operation via dynamic programming. We test the resulting architecture on standard sentiment classification and news categorization tasks. Our model achieves state-of-the-art performance both in terms of accuracy and training speed. For instance, we obtain 51.2% accuracy on the fine-grained sentiment classification task.1.

Cite

CITATION STYLE

APA

Lei, T., Barzilay, R., & Jaakkola, T. (2015). Molding CNNs for text: Non-linear, non-consecutive convolutions. In Conference Proceedings - EMNLP 2015: Conference on Empirical Methods in Natural Language Processing (pp. 1565–1575). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/d15-1180

Molding CNNs for text: Non-linear, non-consecutive convolutions

Abstract

Cite

Register to see more suggestions