Through BM25, the asymptotic term frequency quantification TF = tf/(tf+K), where is the within-document term frequency and K is a normalisation factor, became popular. This paper reports a finding regarding the meaning of the TF quantification: in the triangle of independence and subsumption, the TF quantification forms the altitude, that is, the middle between independent and subsumed events. We refer to this new assumption as semi-subsumed. While this finding of a well-defined probabilistic assumption solves the probabilistic interpretation of the BM25 TF quantification, it is also of wider impact regarding probability theory. © 2009 Springer Berlin Heidelberg.
CITATION STYLE
Wu, H., & Roelleke, T. (2009). Semi-subsumed events: A probabilistic semantics of the BM25 term frequency quantification. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5766 LNCS, pp. 375–379). https://doi.org/10.1007/978-3-642-04417-5_43
Mendeley helps you to discover research relevant for your work.