This paper describes how we tackled the development of Amaia, a conversational agent for Portuguese entrepreneurs. After introducing the domain corpus used as Amaia’s Knowledge Base (KB), we make an extensive comparison of approaches for automatically matching user requests with Frequently Asked Questions (FAQs) in the KB, covering Information Retrieval (IR), approaches based on static and contextual word embeddings, and a model of Semantic Textual Similarity (STS) trained for Portuguese, which achieved the best performance. We further describe how we decreased the model’s complexity and improved scalability, with minimal impact on performance. In the end, Amaia combines an IR library and an STS model with reduced features. Towards a more human-like behavior, Amaia can also answer out-of-domain questions, based on a second corpus integrated in the KB. Such interactions are identified with a text classifier, also described in the paper.
CITATION STYLE
Santos, J., Duarte, L., Ferreira, J., Alves, A., & Oliveira, H. G. (2020). Developing amaia: A conversational agent for helping portuguese entrepreneurs—an extensive exploration of question-matching approaches for portuguese. Information (Switzerland), 11(9), 1–21. https://doi.org/10.3390/info11090428
Mendeley helps you to discover research relevant for your work.