Learning hybrid representations to retrieve semantically equivalent questions

Cícero Dos Santos; Luciano Barbosa; Dasha Bogdanova; Bianca Zadrozny

Conference ProceedingsOPEN ACCESS

Learning hybrid representations to retrieve semantically equivalent questions

ACL-IJCNLP 2015 - 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, Proceedings of the Conference (2015) 2 694-699

DOI: 10.3115/v1/p15-2114

80Citations

125Readers

Abstract

Retrieving similar questions in online Q&A community sites is a difficult task because different users may formulate the same question in a variety of ways, using different vocabulary and structure. In this work, we propose a new neural network architecture to perform the task of semantically equivalent question retrieval. The proposed architecture, which we call BOW-CNN, combines a bag-ofwords (BOW) representation with a distributed vector representation created by a convolutional neural network (CNN). We perform experiments using data collected from two Stack Exchange communities. Our experimental results evidence that: (1) BOW-CNN is more effective than BOW based information retrieval methods such as TFIDF; (2) BOW-CNN is more robust than the pure CNN for long texts.

Cite

CITATION STYLE

APA

Dos Santos, C., Barbosa, L., Bogdanova, D., & Zadrozny, B. (2015). Learning hybrid representations to retrieve semantically equivalent questions. In ACL-IJCNLP 2015 - 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, Proceedings of the Conference (Vol. 2, pp. 694–699). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/p15-2114

Learning hybrid representations to retrieve semantically equivalent questions

Abstract

Cite

Register to see more suggestions