Embedding lexical features via low-rank tensors

Mo Yu; Mark Dredze; Raman Arora; Matthew R. Gormley

Conference ProceedingsOPEN ACCESS

Embedding lexical features via low-rank tensors

2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2016 - Proceedings of the Conference (2016) 1019-1029

DOI: 10.18653/v1/n16-1117

5Citations

102Readers

Abstract

Modern NLP models rely heavily on engineered features, which often combine word and contextual information into complex lexical features. Such combination results in large numbers of features, which can lead to overfitting. We present a new model that represents complex lexical features - comprised of parts for words, contextual information and labels - in a tensor that captures conjunction information among these parts. We apply low-rank tensor approximations to the corresponding parameter tensors to reduce the parameter space and improve prediction speed. Furthermore, we investigate two methods for handling features that include n-grams of mixed lengths. Our model achieves state-of-the-art results on tasks in relation extraction, PP-attachment, and preposition disambiguation.

Cite

CITATION STYLE

APA

Yu, M., Dredze, M., Arora, R., & Gormley, M. R. (2016). Embedding lexical features via low-rank tensors. In 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2016 - Proceedings of the Conference (pp. 1019–1029). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/n16-1117

Embedding lexical features via low-rank tensors

Abstract

Cite

Register to see more suggestions