TimelineQA: A Benchmark for Question Answering over Timelines

3Citations
Citations of this article
18Readers
Mendeley users who have this article in their library.

Abstract

Lifelogs are descriptions of experiences that a person had during their life. Lifelogs are created by fusing data from the multitude of digital services, such as online photos, maps, shopping and content streaming services. Question answering over lifelogs can offer personal assistants a critical resource when they try to provide advice in context. However, obtaining answers to questions over lifelogs is beyond the current state of the art of question answering techniques for a variety of reasons, the most pronounced of which is that lifelogs combine free text with some degree of structure such as temporal and geographical information. We create and publicly release TimelineQA, a benchmark for accelerating progress on querying lifelogs. TimelineQA generates lifelogs of imaginary people. The episodes in the lifelog range from major life episodes such as high school graduation to those that occur on a daily basis such as going for a run. We describe a set of experiments on TimelineQA with several state-of-the-art QA models. Our experiments reveal that for atomic queries, an extractive QA system significantly out-performs a state-of-the-art retrieval-augmented QA system. For multi-hop queries involving aggregates, we show that the best result is obtained with a state-of-the-art table QA technique, assuming the ground truth set of episodes for deriving the answer is available.

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Tan, W. C., Dwivedi-Yu, J., Li, Y., Mathias, L., Saeidi, M., Yan, J. N., & Halevy, A. Y. (2023). TimelineQA: A Benchmark for Question Answering over Timelines. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 77–91). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.findings-acl.6

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 5

63%

Professor / Associate Prof. 1

13%

Lecturer / Post doc 1

13%

Researcher 1

13%

Readers' Discipline

Tooltip

Computer Science 9

90%

Medicine and Dentistry 1

10%

Save time finding and organizing research with Mendeley

Sign up for free