The interest in AutoML search spaces has given rise to a plethora of studies conceived to better understand the characteristics of these spaces. Exploratory landscape analysis is among the most commonly investigated techniques. However, in contrast with other classical optimization problems, in AutoML defining the landscape may be as tough as characterizing it. This is because the concept of solution neighborhood is not clear, as the spaces have a high number of conditional hyperparameters and a somehow hierarchical structure. This paper looks at the impact of different solution representations and distance metrics on the definition of these spaces, and how they affect exploratory landscape analysis metrics. We conclude that these metrics are not able to deal with structured, complex spaces such as the AutoML ones, and problem-related metrics might be the way to leverage the landscape complexity.
CITATION STYLE
Teixeira, M. C., & Pappa, G. L. (2023). On the Effect of Solution Representation and Neighborhood Definition in AutoML Fitness Landscapes. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 13987 LNCS, pp. 227–243). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-30035-6_15
Mendeley helps you to discover research relevant for your work.