Semi-subsumed events: A probabilistic semantics of the BM25 term frequency quantification

0Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Through BM25, the asymptotic term frequency quantification TF = tf/(tf+K), where is the within-document term frequency and K is a normalisation factor, became popular. This paper reports a finding regarding the meaning of the TF quantification: in the triangle of independence and subsumption, the TF quantification forms the altitude, that is, the middle between independent and subsumed events. We refer to this new assumption as semi-subsumed. While this finding of a well-defined probabilistic assumption solves the probabilistic interpretation of the BM25 TF quantification, it is also of wider impact regarding probability theory. © 2009 Springer Berlin Heidelberg.

Cite

CITATION STYLE

APA

Wu, H., & Roelleke, T. (2009). Semi-subsumed events: A probabilistic semantics of the BM25 term frequency quantification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5766 LNCS, pp. 375–379). https://doi.org/10.1007/978-3-642-04417-5_43

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free