Entity-Conditioned Question Generation for Robust Attention Distribution in Neural Information Retrieval

Revanth Gangi Reddy; Md Arafat Sultan; Martin Franz; Avirup Sil; Heng Ji

Conference ProceedingsOPEN ACCESS

Entity-Conditioned Question Generation for Robust Attention Distribution in Neural Information Retrieval

SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (2022) 2462-2466

DOI: 10.1145/3477495.3531878

3Citations

12Readers

Get full text

Abstract

We show that supervised neural information retrieval (IR) models are prone to learning sparse attention patterns over passage tokens, which can result in key phrases including named entities receiving low attention weights, eventually leading to model under-performance. Using a novel targeted synthetic data generation method that identifies poorly attended entities and conditions the generation episodes on those, we teach neural IR to attend more uniformly and robustly to all entities in a given passage. On two public IR benchmarks, we empirically show that the proposed method helps improve both the model's attention patterns and retrieval performance, including in zero-shot settings.

Author supplied keywords

Cite

CITATION STYLE

APA

Reddy, R. G., Sultan, M. A., Franz, M., Sil, A., & Ji, H. (2022). Entity-Conditioned Question Generation for Robust Attention Distribution in Neural Information Retrieval. In SIGIR 2022 - Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 2462–2466). Association for Computing Machinery, Inc. https://doi.org/10.1145/3477495.3531878

Entity-Conditioned Question Generation for Robust Attention Distribution in Neural Information Retrieval

Abstract

Author supplied keywords

Cite

Register to see more suggestions