We describe the setting and results of the ConvAI2 NeurIPS competition that aims to further the state-of-the-art in open-domain chatbots. Some key takeaways from the competition are: (i) pretrained Transformer variants are currently the best performing models on this task, (ii) but to improve performance on multi-turn conversations with humans, future systems must go beyond single word metrics like perplexity to measure the performance across sequences of utterances (conversations) -- in terms of repetition, consistency and balance of dialogue acts (e.g. how many questions asked vs. answered).
CITATION STYLE
Dinan, E., Logacheva, V., Malykh, V., Miller, A., Shuster, K., Urbanek, J., … Weston, J. (2020). The Second Conversational Intelligence Challenge (ConvAI2) (pp. 187–208). https://doi.org/10.1007/978-3-030-29135-8_7
Mendeley helps you to discover research relevant for your work.