Impact of Data Quality on Question Answering System Performances

4Citations
Citations of this article
37Readers
Mendeley users who have this article in their library.

Abstract

In contrast with the research of new models, little attention has been paid to the impact of low or high-quality data feeding a dialogue system. The present paper makes the first attempt to fill this gap by extending our previous work on question-answering (QA) systems by investigating the effect of misspelling on QA agents and how context changes can enhance the responses. Instead of using large language models trained on huge datasets, we propose a method that enhances the model's score by modifying only the quality and structure of the data feed to the model. It is important to identify the features that modify the agent performance because a high rate of wrong answers can make the students lose their interest in using the QA agent as an additional tool for distant learning. The results demonstrate the accuracy of the proposed context simplification exceeds 85%. These findings shed light on the importance of question data quality and context complexity construct as key dimensions of the QA system. In conclusion, the experimental results on questions and contexts showed that controlling and improving the various aspects of data quality around the QA system can significantly enhance his robustness and performance.

References Powered by Scopus

Term-weighting approaches in automatic text retrieval

6799Citations
N/AReaders
Get full text

SQuad: 100,000+ questions for machine comprehension of text

3976Citations
N/AReaders
Get full text

Beyond accuracy: What data quality means to data consumers

3247Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Designing a Chatbot for Contemporary Education: A Systematic Literature Review

8Citations
N/AReaders
Get full text

Analysis of QA System Behavior against Context and Question Changes

3Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Karra, R., & Lasfar, A. (2023). Impact of Data Quality on Question Answering System Performances. Intelligent Automation and Soft Computing, 35(1), 335–349. https://doi.org/10.32604/iasc.2023.026695

Readers' Seniority

Tooltip

Professor / Associate Prof. 13

68%

PhD / Post grad / Masters / Doc 5

26%

Researcher 1

5%

Readers' Discipline

Tooltip

Computer Science 4

44%

Business, Management and Accounting 3

33%

Social Sciences 1

11%

Engineering 1

11%

Save time finding and organizing research with Mendeley

Sign up for free