Fast Gradient Boosting Decision Trees with Bit-Level Data Structures

Laurens Devos; Wannes Meert; Jesse Davis

Conference Proceedings

Fast Gradient Boosting Decision Trees with Bit-Level Data Structures

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2020) 11906 LNAI 590-606

DOI: 10.1007/978-3-030-46150-8_35

9Citations

20Readers

Get full text

Abstract

A gradient boosting decision tree model is a powerful machine learning method that iteratively constructs decision trees to form an additive ensemble model. The method uses the gradient of the loss function to improve the model at each iteration step. Inspired by the database literature, we exploit bitset and bitslice data structures in order to improve the run time efficiency of learning the trees. We can use these structures in two ways. First, they can represent the input data itself. Second, they can store the discretized gradient values used by the learning algorithm to construct the trees in the boosting model. Using these bit-level data structures reduces the problem of finding the best split, which involves counting of instances and summing gradient values, to counting one-bits in bit strings. Modern CPUs can efficiently count one-bits using AVX2 SIMD instructions. Empirically, our proposed improvements can result in speed-ups of 2 to up to 10 times on datasets with a large number of categorical features without sacrificing predictive performance.

Author supplied keywords

Cite

CITATION STYLE

APA

Devos, L., Meert, W., & Davis, J. (2020). Fast Gradient Boosting Decision Trees with Bit-Level Data Structures. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11906 LNAI, pp. 590–606). Springer. https://doi.org/10.1007/978-3-030-46150-8_35

Fast Gradient Boosting Decision Trees with Bit-Level Data Structures

Abstract

Author supplied keywords

Cite

Register to see more suggestions