A framework for selecting deep learning hyper-parameters

Jim O’Donoghue; Mark Roantree

Conference Proceedings

A framework for selecting deep learning hyper-parameters

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9147 120-132

DOI: 10.1007/978-3-319-20424-6_12

11Citations

9Readers

Get full text

Abstract

Recent research has found that deep learning architectures show significant improvements over traditional shallow algorithms when mining high dimensional datasets. When the choice of algorithm employed, hyper-parameter setting, number of hidden layers and nodes within a layer are combined, the identification of an optimal configuration can be a lengthy process. Our work provides a framework for building deep learning architectures via a stepwise approach, together with an evaluation methodology to quickly identify poorly performing architectural configurations. Using a dataset with high dimensionality, we illustrate how different architectures perform and how one algorithm configuration can provide input for fine-tuning more complex models.

Cite

CITATION STYLE

APA

O’Donoghue, J., & Roantree, M. (2015). A framework for selecting deep learning hyper-parameters. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9147, pp. 120–132). Springer Verlag. https://doi.org/10.1007/978-3-319-20424-6_12

A framework for selecting deep learning hyper-parameters

Abstract

Cite

Register to see more suggestions