Natural gradient via optimal transport

36Citations
Citations of this article
51Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We study a natural Wasserstein gradient flow on manifolds of probability distributions with discrete sample spaces. We derive the Riemannian structure for the probability simplex from the dynamical formulation of the Wasserstein distance on a weighted graph. We pull back the geometric structure to the parameter space of any given probability model, which allows us to define a natural gradient flow there. In contrast to the natural Fisher–Rao gradient, the natural Wasserstein gradient incorporates a ground metric on sample space. We illustrate the analysis of elementary exponential family examples and demonstrate an application of the Wasserstein natural gradient to maximum likelihood estimation.

Cite

CITATION STYLE

APA

Li, W., & Montúfar, G. (2018). Natural gradient via optimal transport. Information Geometry, 1(2), 181–214. https://doi.org/10.1007/s41884-018-0015-3

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free