Learning deep architectures for AI

Yoshua Bengio

Journal Article

Learning deep architectures for AI

Bengio Y

Foundations and Trends in Machine Learning (2009) 2(1) 1-27

DOI: 10.1561/2200000006

6.4kCitations

9.6kReaders

Get full text

Abstract

Theoretical results suggest that in order to learn the kind of complicated functions that can represent high-level abstractions (e.g., in vision, language, and other AI-level tasks), one may need deep architectures. Deep architectures are composed of multiple levels of non-linear operations, such as in neural nets with many hidden layers or in complicated propositional formulae re-using many sub-formulae. Searching the parameter space of deep architectures is a difficult task, but learning algorithms such as those for Deep Belief Networks have recently been proposed to tackle this problem with notable success, beating the stateof-the-art in certain areas. This monograph discusses the motivations and principles regarding learning algorithms for deep architectures, in particular those exploiting as building blocks unsupervised learning of single-layer models such as Restricted Boltzmann Machines, used to construct deeper models such as Deep Belief Networks. © 2009 Y. Bengio.

Cite

CITATION STYLE

APA

Bengio, Y. (2009). Learning deep architectures for AI. Foundations and Trends in Machine Learning, 2(1), 1–27. https://doi.org/10.1561/2200000006

Learning deep architectures for AI

Abstract

Cite

Register to see more suggestions