Training efficient tree-based models for document ranking

Nima Asadi; Jimmy Lin

Conference Proceedings

Training efficient tree-based models for document ranking

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2013) 7814 LNCS 146-157

DOI: 10.1007/978-3-642-36973-5_13

24Citations

26Readers

Get full text

Abstract

Gradient-boosted regression trees (GBRTs) have proven to be an effective solution to the learning-to-rank problem. This work proposes and evaluates techniques for training GBRTs that have efficient runtime characteristics. Our approach is based on the simple idea that compact, shallow, and balanced trees yield faster predictions: thus, it makes sense to incorporate some notion of execution cost during training to "encourage" trees with these topological characteristics. We propose two strategies for accomplishing this: the first, by directly modifying the node splitting criterion during tree induction, and the second, by stagewise tree pruning. Experiments on a standard learning-to-rank dataset show that the pruning approach is superior; one balanced setting yields an approximately 40% decrease in prediction latency with minimal reduction in output quality as measured by NDCG. © 2013 Springer-Verlag.

Cite

CITATION STYLE

APA

Asadi, N., & Lin, J. (2013). Training efficient tree-based models for document ranking. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7814 LNCS, pp. 146–157). https://doi.org/10.1007/978-3-642-36973-5_13

Training efficient tree-based models for document ranking

Abstract

Cite

Register to see more suggestions