It is shown in [4] that Rocchio's similarity-based relevance feedback algorithm makes Ω(n) mistakes in searching for a collection of documents represented by a monotone disjunction of at most k relevant features (or terms) over the n-dimensional binary vector space {0, 1}n. In practice, Rocchio's algorithm often uses a fixed query updating factor and a fixed classification threshold. When this is the case, we strengthen the work in [4] in this paper and prove that Rocchio's algorithm makes Ω(k(n - k)) mistakes in searching for the same collection of documents over the binary vector space {0, 1}n. A quadratic lower bound is obtained when k is proportional to n. An O(k(n - k)2) upper bound is also obtained. © Springer-Verlag Berlin Heidelberg 2005.
CITATION STYLE
Chen, Z., & Fu, B. (2005). A quadratic lower bound for Rocchio’s similarity-based relevance feedback algorithm. In Lecture Notes in Computer Science (Vol. 3595, pp. 955–964). Springer Verlag. https://doi.org/10.1007/11533719_96
Mendeley helps you to discover research relevant for your work.