A variational perspective on accelerated methods in optimization

Andre Wibisono; Ashia C. Wilson; Michael I. Jordan

Journal ArticleOPEN ACCESS

A variational perspective on accelerated methods in optimization

Proceedings of the National Academy of Sciences of the United States of America (2016) 113(47) E7351-E7358

DOI: 10.1073/pnas.1614734113

313Citations

281Readers

Abstract

Accelerated gradient methods play a central role in optimization, achieving optimal rates in many settings. Although many generalizations and extensions of Nesterov's original acceleration method have been proposed, it is not yet clear what is the natural scope of the acceleration concept. In this paper, we study accelerated methods from a continuous-time perspective. We show that there is a Lagrangian functional that we call the Bregman Lagrangian, which generates a large class of accelerated methods in continuous time, including (but not limited to) accelerated gradient descent, its non-Euclidean extension, and accelerated higher-order gradient methods. We show that the continuous-time limit of all of these methods corresponds to traveling the same curve in spacetime at different speeds. From this perspective, Nesterov's technique and many of its generalizations can be viewed as a systematic way to go from the continuous-time curves generated by the Bregman Lagrangian to a family of discrete-time accelerated algorithms.

Author supplied keywords

Cite

CITATION STYLE

APA

Wibisono, A., Wilson, A. C., & Jordan, M. I. (2016). A variational perspective on accelerated methods in optimization. Proceedings of the National Academy of Sciences of the United States of America, 113(47), E7351–E7358. https://doi.org/10.1073/pnas.1614734113

A variational perspective on accelerated methods in optimization

Abstract

Author supplied keywords

Cite

Register to see more suggestions