Abstract
Regression trees and their ensemble methods are popular methods for nonparametric regression: they combine strong predictive performance with interpretable estimators. To improve their utility for locally smooth response surfaces, we study regression trees and random forests with linear aggregation functions. We introduce a new algorithm that finds the best axis-aligned split to fit linear aggregation functions on the corresponding nodes, and we offer a quasilinear time implementation. We demonstrate the algorithm’s favorable performance on real-world benchmarks and in an extensive simulation study, and we demonstrate its improved interpretability using a large get-out-the-vote experiment. We provide an open-source software package that implements several tree-based estimators with linear aggregation functions. Supplementary materials for this article are available online.
Author supplied keywords
Cite
CITATION STYLE
Künzel, S. R., Saarinen, T. F., Liu, E. W., & Sekhon, J. S. (2022). Linear Aggregation in Tree-Based Estimators. Journal of Computational and Graphical Statistics, 31(3), 917–934. https://doi.org/10.1080/10618600.2022.2026780
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.