Do People and Neural Nets Pay Attention to the Same Words: Studying Eye-tracking Data for Non-factoid QA Evaluation

Valeria Bolotova; Vladislav Blinov; Yukun Zheng; W. Bruce Croft; Falk Scholer; Mark Sanderson

Conference ProceedingsOPEN ACCESS

Do People and Neural Nets Pay Attention to the Same Words: Studying Eye-tracking Data for Non-factoid QA Evaluation

International Conference on Information and Knowledge Management, Proceedings (2020) 85-94

DOI: 10.1145/3340531.3412043

16Citations

38Readers

Get full text

Abstract

We investigated how users evaluate passage-length answers for non-factoid questions. We conduct a study where answers were presented to users, sometimes shown with automatic word highlighting. Users were tasked with evaluating answer quality, correctness, completeness, and conciseness. Words in the answer were also annotated, both explicitly through user mark up and implicitly through user gaze data obtained from eye-tracking. Our results show that the correctness of an answer strongly depends on its completeness, conciseness is less important. Analysis of the annotated words showed correct and incorrect answers were assessed differently. Automatic highlighting helped users to evaluate answers quicker while maintaining accuracy, particularly when highlighting was similar to annotation. We fine-tuned a BERT model on a non-factoid QA task to examine if the model attends to words similar to those annotated. Similarity was found, consequently, we propose a method to exploit the BERT attention map to generate suggestions that simulate eye gaze during user evaluation.

Author supplied keywords

Cite

CITATION STYLE

APA

Bolotova, V., Blinov, V., Zheng, Y., Croft, W. B., Scholer, F., & Sanderson, M. (2020). Do People and Neural Nets Pay Attention to the Same Words: Studying Eye-tracking Data for Non-factoid QA Evaluation. In International Conference on Information and Knowledge Management, Proceedings (pp. 85–94). Association for Computing Machinery. https://doi.org/10.1145/3340531.3412043

Do People and Neural Nets Pay Attention to the Same Words: Studying Eye-tracking Data for Non-factoid QA Evaluation

Abstract

Author supplied keywords

Cite

Register to see more suggestions