Hyperparameter optimization with factorized multilayer perceptrons

Nicolas Schilling; Martin Wistuba; Lucas Drumond; Lars Schmidt-Thieme

Conference ProceedingsOPEN ACCESS

Hyperparameter optimization with factorized multilayer perceptrons

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9285 87-103

DOI: 10.1007/978-3-319-23525-7_6

31Citations

29Readers

Abstract

In machine learning, hyperparameter optimization is a challenging task that is usually approached by experienced practitioners or in a computationally expensive brute-force manner such as grid-search. Therefore, recent research proposes to use observed hyperparameter performance on already solved problems (i.e. data sets) in order to speed up the search for promising hyperparameter configurations in the sequential model based optimization framework. In this paper, we propose multilayer perceptrons as surrogate models as they are able to model highly nonlinear hyperparameter response surfaces. However, since interactions of hyperparameters, data sets and metafeatures are only implicitly learned in the subsequent layers, we improve the performance of multilayer perceptrons by means of an explicit factorization of the interaction weights and call the resulting model a factorized multilayer perceptron. Additionally, we evaluate different ways of obtaining predictive uncertainty, which is a key ingredient for a decent tradeoff between exploration and exploitation. Our experimental results on two public meta data sets demonstrate the efficiency of our approach compared to a variety of published baselines. For reproduction purposes, we make our data sets and all the program code publicly available on our supplementary webpage.

Author supplied keywords

Cite

CITATION STYLE

APA

Schilling, N., Wistuba, M., Drumond, L., & Schmidt-Thieme, L. (2015). Hyperparameter optimization with factorized multilayer perceptrons. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9285, pp. 87–103). Springer Verlag. https://doi.org/10.1007/978-3-319-23525-7_6

Hyperparameter optimization with factorized multilayer perceptrons

Abstract

Author supplied keywords

Cite

Register to see more suggestions