B-PROP: Bootstrapped Pre-training with Representative Words Prediction for Ad-hoc Retrieval

Xinyu Ma; Jiafeng Guo; Ruqing Zhang; Yixing Fan; Yingyan Li; Xueqi Cheng

Conference ProceedingsOPEN ACCESS

B-PROP: Bootstrapped Pre-training with Representative Words Prediction for Ad-hoc Retrieval

SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval (2021) 1318-1327

DOI: 10.1145/3404835.3462869

34Citations

34Readers

Get full text

Abstract

Pre-training and fine-tuning have achieved remarkable success in many downstream natural language processing (NLP) tasks. Recently, pre-training methods tailored for information retrieval (IR) have also been explored, and the latest success is the PROP method which has reached new SOTA on a variety of ad-hoc retrieval benchmarks. The basic idea of PROP is to construct therepresentative words prediction (ROP) task for pre-training inspired by the query likelihood model. Despite its exciting performance, the effectiveness of PROP might be bounded by the classical unigram language model adopted in the ROP task construction process. To tackle this problem, we propose a bootstrapped pre-training method (namely B-PROP) based on BERT for ad-hoc retrieval. The key idea is to use the powerful contextual language model BERT to replace the classical unigram language model for the ROP task construction, and re-train BERT itself towards the tailored objective for IR. Specifically, we introduce a novel contrastive method, inspired by the divergence-from-randomness idea, to leverage BERT's self-attention mechanism to sample representative words from the document. By further fine-tuning on downstream ad-hoc retrieval tasks, our method achieves significant improvements over PROP and other baselines, and further pushes forward the SOTA on a variety of ad-hoc retrieval tasks.

Author supplied keywords

Cite

CITATION STYLE

APA

Ma, X., Guo, J., Zhang, R., Fan, Y., Li, Y., & Cheng, X. (2021). B-PROP: Bootstrapped Pre-training with Representative Words Prediction for Ad-hoc Retrieval. In SIGIR 2021 - Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 1318–1327). Association for Computing Machinery, Inc. https://doi.org/10.1145/3404835.3462869

B-PROP: Bootstrapped Pre-training with Representative Words Prediction for Ad-hoc Retrieval

Abstract

Author supplied keywords

Cite

Register to see more suggestions