Classification is an important machine learning problem, and decision tree construction algorithms are an important class of solutions to this problem. RainForest is a scalable way to implement decision tree construction algorithms. It consists of several algorithms, of which the best one is a hybrid between a traditional recursive implementation and an iterative implementation which uses more memory but involves less write operations. We propose an optimized algorithm inspired by RainForest. By using a more sophisticated switching criterion between the two algorithms, we are able to get a performance gain even when all statistical information fits in memory. Evaluations show that our method can achieve a performance boost of 2.8 times in average than the traditional recursive implementation.
CITATION STYLE
Yang, Y., & Chen, W. (2016). Taiga: Performance optimization of the C4.5 decision tree construction algorithm. Tsinghua Science and Technology, 21(4), 415–425. https://doi.org/10.1109/TST.2016.7536719
Mendeley helps you to discover research relevant for your work.