A strong baseline for question relevancy ranking

Ana V. González-Garduño; Isabelle Augenstein; Anders Søgaard

Conference ProceedingsOPEN ACCESS

A strong baseline for question relevancy ranking

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018 (2018) 4810-4815

DOI: 10.18653/v1/d18-1515

3Citations

103Readers

Abstract

The best systems at the SemEval-16 and SemEval-17 community question answering shared tasks - a task that amounts to question relevancy ranking - involve complex pipelines and manual feature engineering. Despite this, many of these still fail at beating the IR baseline, i.e., the rankings provided by Google's search engine. We present a strong baseline for question relevancy ranking by training a simple multi-task feed forward network on a bag of 14 distance measures for the input question pair. This baseline model, which is fast to train and uses only language-independent features, outperforms the best shared task systems on the task of retrieving relevant previously asked questions.

Cite

CITATION STYLE

APA

González-Garduño, A. V., Augenstein, I., & Søgaard, A. (2018). A strong baseline for question relevancy ranking. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018 (pp. 4810–4815). Association for Computational Linguistics. https://doi.org/10.18653/v1/d18-1515

A strong baseline for question relevancy ranking

Abstract

Cite

Register to see more suggestions