Complexity control by gradient descent in deep networks

Tomaso Poggio; Qianli Liao; Andrzej Banburski

Journal ArticleOPEN ACCESS

Complexity control by gradient descent in deep networks

Nature Communications (2020) 11(1)

DOI: 10.1038/s41467-020-14663-9

23Citations

80Readers

Abstract

Overparametrized deep networks predict well, despite the lack of an explicit complexity control during training, such as an explicit regularization term. For exponential-type loss functions, we solve this puzzle by showing an effective regularization effect of gradient descent in terms of the normalized weights that are relevant for classification.

Cite

CITATION STYLE

APA

Poggio, T., Liao, Q., & Banburski, A. (2020). Complexity control by gradient descent in deep networks. Nature Communications, 11(1). https://doi.org/10.1038/s41467-020-14663-9

Complexity control by gradient descent in deep networks

Abstract

Cite

Register to see more suggestions