Fitness Landscape Analysis of Automated Machine Learning Search Spaces

Cristiano G. Pimenta; Alex G.C. de Sá; Gabriela Ochoa; Gisele L. Pappa

Conference Proceedings

Fitness Landscape Analysis of Automated Machine Learning Search Spaces

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2020) 12102 LNCS 114-130

DOI: 10.1007/978-3-030-43680-3_8

13Citations

16Readers

Get full text

Abstract

The field of Automated Machine Learning (AutoML) has as its main goal to automate the process of creating complete Machine Learning (ML) pipelines to any dataset without requiring deep user expertise in ML. Several AutoML methods have been proposed so far, but there is not a single one that really stands out. Furthermore, there is a lack of studies on the characteristics of the fitness landscape of AutoML search spaces. Such analysis may help to understand the performance of different optimization methods for AutoML and how to improve them. This paper adapts classic fitness landscape analysis measures to the context of AutoML. This is a challenging task, as AutoML search spaces include discrete, continuous, categorical and conditional hyperparameters. We propose an ML pipeline representation, a neighborhood definition and a distance metric between pipelines, and use them in the evaluation of the fitness distance correlation (FDC) and the neutrality ratio for a given AutoML search space. Results of FDC are counter-intuitive and require a more in-depth analysis of a range of search spaces. Results of neutrality, in turn, show a strong positive correlation between the mean neutrality ratio and the fitness value.

Author supplied keywords

Cite

CITATION STYLE

APA

Pimenta, C. G., de Sá, A. G. C., Ochoa, G., & Pappa, G. L. (2020). Fitness Landscape Analysis of Automated Machine Learning Search Spaces. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 12102 LNCS, pp. 114–130). Springer. https://doi.org/10.1007/978-3-030-43680-3_8

Fitness Landscape Analysis of Automated Machine Learning Search Spaces

Abstract

Author supplied keywords

Cite

Register to see more suggestions