Large scale biomedical data analysis with tree-based automated machine learning

0Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.

Abstract

Tree-based Pipeline Optimization Tool (TPOT) is an automated machine learning (AutoML) system that recommends optimal pipeline for supervised learning problems by scanning data for novel features, selecting appropriate models and optimizing their parameters. However, like other AutoML systems, TPOT may reach computational resource limits when working on big data such as whole-genome expression data. We develop two novel features for TPOT, Feature Set Selector and Template, which leverage domain knowledge, greatly reduce the computational expense and flexibly extend TPOT's application to biomedical big data analysis.

Author supplied keywords

Cite

CITATION STYLE

APA

Le, T. T., Fu, W., & Moore, J. H. (2020). Large scale biomedical data analysis with tree-based automated machine learning. In GECCO 2020 Companion - Proceedings of the 2020 Genetic and Evolutionary Computation Conference Companion (pp. 21–22). Association for Computing Machinery, Inc. https://doi.org/10.1145/3377929.3397770

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free