A Hybrid Text Generation-Based Query Expansion Method for Open-Domain Question Answering

1Citations
Citations of this article
27Readers
Mendeley users who have this article in their library.

Abstract

In the two-stage open-domain question answering (OpenQA) systems, the retriever identifies a subset of relevant passages, which the reader then uses to extract or generate answers. However, the performance of OpenQA systems is often hindered by issues such as short and semantically ambiguous queries, making it challenging for the retriever to find relevant passages quickly. This paper introduces Hybrid Text Generation-Based Query Expansion (HTGQE), an effective method to improve retrieval efficiency. HTGQE combines large language models with Pseudo-Relevance Feedback techniques to enhance the input for generative models, improving text generation speed and quality. Building on this foundation, HTGQE employs multiple query expansion generators, each trained to provide query expansion contexts from distinct perspectives. This enables the retriever to explore relevant passages from various angles for complementary retrieval results. As a result, under an extractive and generative QA setup, HTGQE achieves promising results on both Natural Questions (NQ) and TriviaQA (Trivia) datasets for passage retrieval and reading tasks.

Cite

CITATION STYLE

APA

Zhu, W., Zhang, X., Zhai, Q., & Liu, C. (2023). A Hybrid Text Generation-Based Query Expansion Method for Open-Domain Question Answering. Future Internet, 15(5). https://doi.org/10.3390/fi15050180

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free