Tree-based Pipeline Optimization Tool (TPOT) is an automated machine learning (AutoML) system that recommends optimal pipeline for supervised learning problems by scanning data for novel features, selecting appropriate models and optimizing their parameters. However, like other AutoML systems, TPOT may reach computational resource limits when working on big data such as whole-genome expression data. We develop two novel features for TPOT, Feature Set Selector and Template, which leverage domain knowledge, greatly reduce the computational expense and flexibly extend TPOT's application to biomedical big data analysis.
CITATION STYLE
Le, T. T., Fu, W., & Moore, J. H. (2020). Large scale biomedical data analysis with tree-based automated machine learning. In GECCO 2020 Companion - Proceedings of the 2020 Genetic and Evolutionary Computation Conference Companion (pp. 21–22). Association for Computing Machinery, Inc. https://doi.org/10.1145/3377929.3397770
Mendeley helps you to discover research relevant for your work.