Learning continuousword embedding with metadata for question retrieval in community question answering

N/ACitations
Citations of this article
199Readers
Mendeley users who have this article in their library.

Abstract

Community question answering (cQA) has become an important issue due to the popularity of cQA archives on the web. This paper is concerned with the problem of question retrieval. Question retrieval in cQA archives aims to find the existing questions that are semantically equivalent or relevant to the queried questions. However, the lexical gap problem brings about new challenge for question retrieval in cQA. In this paper, we propose to learn continuous word embeddings with metadata of category information within cQA pages for question retrieval. To deal with the variable size of word embedding vectors, we employ the framework of fisher kernel to aggregated them into the fixedlength vectors. Experimental results on large-scale real world cQA data set show that our approach can significantly outperform state-of-The-Art translation models and topic-based models for question retrieval in cQA.

Cite

CITATION STYLE

APA

Zhou, G., He, T., Zhao, J., & Hu, P. (2015). Learning continuousword embedding with metadata for question retrieval in community question answering. In ACL-IJCNLP 2015 - 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, Proceedings of the Conference (Vol. 1, pp. 250–259). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/p15-1025

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free