Dense Retrieval with Entity Views

5Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Pre-trained language models like BERT have been demonstrated to be both effective and efficient ranking methods when combined with approximate nearest neighbor search, which can quickly match dense representations of queries and documents. However, pretrained language models alone do not fully capture information about uncommon entities. In this work, we investigate methods for enriching dense query and document representations with entity information from an external source. Our proposed method identifies groups of entities in a text and encodes them into a dense vector representation, which is then used to enrich BERT's vector representation of the text. To handle documents that contain many loosely-related entities, we devise a strategy for creating multiple entity representations that reflect different views of a document. For example, a document about a scientist may cover aspects of her personal life and recent work, which correspond to different views of the entity. In an evaluation on MS MARCO benchmarks, we find that enriching query and document representations in this way yields substantial increases in effectiveness.

Cite

CITATION STYLE

APA

Tran, H. D., & Yates, A. (2022). Dense Retrieval with Entity Views. In International Conference on Information and Knowledge Management, Proceedings (pp. 1955–1964). Association for Computing Machinery. https://doi.org/10.1145/3511808.3557285

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free