Learning to Re-Rank with Contextualized Stopwords

4Citations
Citations of this article
26Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The use of stopwords has been thoroughly studied in traditional Information Retrieval systems, but remains unexplored in the context of neural models. Neural re-ranking models take the full text of both the query and document into account. Naturally, removing tokens that do not carry relevance information provides us with an opportunity to improve the effectiveness by reducing noise and lower document representation caching-storage requirements. In this work we propose a novel contextualized stopword detection mechanism for neural re-ranking models. This mechanism consists of training a sparse vector in order to filter out document tokens from the ranking decision. This vector is learned end-to-end based on the contextualized document representations, allowing the model to filter terms on a per occurrence basis. This leads to a more explainable model, as it reduces noise. We integrate our component into the state-of-the-art interaction-based TK neural re-ranking model. Our experiments on the MS MARCO passage collection and queries from the TREC 2019 Deep Learning Track show that filtering out traditional stopwords prior to the neural model reduces its effectiveness, while learning to filter out contextualized representations improves it.

References Powered by Scopus

Understanding inverse document frequency: On theoretical arguments for IDF

1110Citations
N/AReaders
Get full text

A deep relevance matching model for Ad-hoc retrieval

677Citations
N/AReaders
Get full text

End-To-end neural ad-hoc ranking with kernel pooling

459Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Perspectives of non-expert users on cyber security and privacy: An analysis of online discussions on twitter

18Citations
N/AReaders
Get full text

Introducing Neural Bag of Whole-Words with ColBERTer: Contextualized Late Interactions using Enhanced Reduction

17Citations
N/AReaders
Get full text

On Approximate Nearest Neighbour Selection for Multi-Stage Dense Retrieval

12Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Hofstätter, S., Lipani, A., Zlabinger, M., & Hanbury, A. (2020). Learning to Re-Rank with Contextualized Stopwords. In International Conference on Information and Knowledge Management, Proceedings (pp. 2057–2060). Association for Computing Machinery. https://doi.org/10.1145/3340531.3412079

Readers over time

‘20‘21‘22‘23‘25036912

Readers' Seniority

Tooltip

Researcher 14

70%

PhD / Post grad / Masters / Doc 5

25%

Lecturer / Post doc 1

5%

Readers' Discipline

Tooltip

Computer Science 19

86%

Biochemistry, Genetics and Molecular Bi... 1

5%

Business, Management and Accounting 1

5%

Linguistics 1

5%

Save time finding and organizing research with Mendeley

Sign up for free
0