Evolutionary reinforcement learning of dynamical large deviations

19Citations
Citations of this article
35Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We show how to bound and calculate the likelihood of dynamical large deviations using evolutionary reinforcement learning. An agent, a stochastic model, propagates a continuous-time Monte Carlo trajectory and receives a reward conditioned upon the values of certain path-extensive quantities. Evolution produces progressively fitter agents, potentially allowing the calculation of a piece of a large-deviation rate function for a particular model and path-extensive quantity. For models with small state spaces, the evolutionary process acts directly on rates, and for models with large state spaces, the process acts on the weights of a neural network that parameterizes the model's rates. This approach shows how path-extensive physics problems can be considered within a framework widely used in machine learning.

Cite

CITATION STYLE

APA

Whitelam, S., Jacobson, D., & Tamblyn, I. (2020). Evolutionary reinforcement learning of dynamical large deviations. Journal of Chemical Physics, 153(4). https://doi.org/10.1063/5.0015301

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free