Building chatbots from forum data: Model selection using question answering metrics

Martin Boyanov; Ivan Koychev; Preslav Nakov; Alessandro Moschitti; Giovanni Da San Martino

Conference ProceedingsOPEN ACCESS

Building chatbots from forum data: Model selection using question answering metrics

International Conference Recent Advances in Natural Language Processing, RANLP (2017) 2017-September 121-129

DOI: 10.26615/978-954-452-049-6_018

4Citations

122Readers

Abstract

We propose to use question answering (QA) data from Web forums to train chatbots from scratch, i.e., without dialog training data. First, we exrtact pairs of question and answer sentences from the typically much longer texts of questions and answers in a forum. We then use these shorter texts to train seq2seq models in a more efficient way. We further improve the parameter optimization using a new model selection strategy based on QA measures. Finally, we propose to use extrinsic evaluation with respect to a QA task as an automatic evaluation method for chatbots. Fhe evaluation shows that the model achieves a MAP of 63.5% on the extrinsic task. Moreover, it can answer correctly 49.5% of the questions when they are similar to questions asked in the forum, and 47.3% of the questions when they are more conversational in style.

Cite

CITATION STYLE

APA

Boyanov, M., Koychev, I., Nakov, P., Moschitti, A., & Da San Martino, G. (2017). Building chatbots from forum data: Model selection using question answering metrics. In International Conference Recent Advances in Natural Language Processing, RANLP (Vol. 2017-September, pp. 121–129). Incoma Ltd. https://doi.org/10.26615/978-954-452-049-6_018

Building chatbots from forum data: Model selection using question answering metrics

Abstract

Cite

Register to see more suggestions