Aggretriever: A Simple Approach to Aggregate Textual Representations for Robust Dense Passage Retrieval

Sheng Chieh Lin; Minghan Li; Jimmy Lin

Journal ArticleOPEN ACCESS

Aggretriever: A Simple Approach to Aggregate Textual Representations for Robust Dense Passage Retrieval

Transactions of the Association for Computational Linguistics (2023) 11 436-452

DOI: 10.1162/tacl_a_00556

22Citations

40Readers

Abstract

Pre-trained language models have been successful in many knowledge-intensive NLP tasks. However, recent work has shown that models such as BERT are not ‘‘structurally ready’’ to aggregate textual information into a [cls] vector for dense passage retrieval (DPR). This ‘‘lack of readiness’’ results from the gap between language model pre-training and DPR fine-tuning. Previous solutions call for computationally expensive techniques such as hard negative mining, cross-encoder distillation, and further pre-training to learn a robust DPR model. In this work, we instead propose to fully exploit knowledge in a pretrained language model for DPR by aggregating the contextualized token embeddings into a dense vector, which we call agg*. By concatenating vectors from the [cls] token and agg*, our Aggretriever model substantially improves the effectiveness of dense retrieval models on both in-domain and zero-shot evaluations without introducing substantial training overhead. Code is available at https://github.com/castorini/dhr.

Cite

CITATION STYLE

APA

Lin, S. C., Li, M., & Lin, J. (2023). Aggretriever: A Simple Approach to Aggregate Textual Representations for Robust Dense Passage Retrieval. Transactions of the Association for Computational Linguistics, 11, 436–452. https://doi.org/10.1162/tacl_a_00556

Aggretriever: A Simple Approach to Aggregate Textual Representations for Robust Dense Passage Retrieval

Abstract

Cite

Register to see more suggestions