Enhancing Generative Retrieval with Reinforcement Learning from Relevance Feedback

Yujia Zhou; Zhicheng Dou; Ji Rong Wen

Conference Proceedings

Enhancing Generative Retrieval with Reinforcement Learning from Relevance Feedback

EMNLP 2023 - 2023 Conference on Empirical Methods in Natural Language Processing, Proceedings (2023) 12481-12490

DOI: 10.18653/v1/2023.emnlp-main.768

22Citations

12Readers

Get full text

Abstract

The recent advent of end-to-end generative retrieval marks a significant shift in document retrieval methods, leveraging differentiable search indexes to directly produce relevant document identifiers (docids) in response to a specific query. Nevertheless, this approach faces two fundamental challenges: (i) a discrepancy between the token-level probabilistic optimization and the broader document-level relevance estimation; (ii) an overemphasis on top-1 results at the expense of overall ranking quality. To tackle these challenges, we propose a generative retrieval model with reinforcement learning from relevance feedback, which aims to align token-level docid generation with document-level relevance estimation. The training process incorporates three stages: supervised fine-tuning, relevance reward model training, and reinforced learning-to-rank from relevance feedback. To train a high-quality reward model, we define "relevance" under three progressive scenarios, which collectively offer a comprehensive evaluation of the document relevance. Experiments conducted on two benchmark datasets demonstrate the effectiveness of our proposed approach.

Cite

CITATION STYLE

APA

Zhou, Y., Dou, Z., & Wen, J. R. (2023). Enhancing Generative Retrieval with Reinforcement Learning from Relevance Feedback. In EMNLP 2023 - 2023 Conference on Empirical Methods in Natural Language Processing, Proceedings (pp. 12481–12490). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.emnlp-main.768

Enhancing Generative Retrieval with Reinforcement Learning from Relevance Feedback

Abstract

Cite

Register to see more suggestions