Rule-based active sampling for learning to rank

Rodrigo Silva; Marcos A. Gonçalves; Adriano Veloso

Conference ProceedingsOPEN ACCESS

Rule-based active sampling for learning to rank

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2011) 6913 LNAI(PART 3) 240-255

DOI: 10.1007/978-3-642-23808-6_16

24Citations

16Readers

Abstract

Learning to rank (L2R) algorithms rely on a labeled training set to generate a ranking model that can be later used to rank new query results. Producing these labeled training sets is usually very costly as it requires human annotators to assess the relevance or order the elements in the training set. Recently, active learning alternatives have been proposed to reduce the labeling effort by selectively sampling an unlabeled set. In this paper we propose a novel rule-based active sampling method for Learning to Rank. Our method actively samples an unlabeled set, selecting new documents to be labeled based on how many relevance inference rules they generate given the previously selected and labeled examples. The smaller the number of generated rules, the more dissimilar and more "informative" is a document with regard to the current state of the labeled set. Differently from previous solutions, our algorithm does not rely on an initial training seed and can be directly applied to an unlabeled dataset. Also in contrast to previous work, we have a clear stop criterion and do not need to empirically discover the best configuration by running a number of iterations on the validation or test sets. These characteristics make our algorithm highly practical. We demonstrate the effectiveness of our active sampling method on several benchmarking datasets, showing that a significant reduction in training size is possible. Our method selects as little as 1.1% and at most 2.2% of the original training sets, while providing competitive results when compared to state-of-the-art supervised L2R algorithms that use the complete training sets. © 2011 Springer-Verlag.

Cite

CITATION STYLE

APA

Silva, R., Gonçalves, M. A., & Veloso, A. (2011). Rule-based active sampling for learning to rank. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6913 LNAI, pp. 240–255). https://doi.org/10.1007/978-3-642-23808-6_16

Rule-based active sampling for learning to rank

Abstract

Cite

Register to see more suggestions