Designing Sun–Earth L2 Halo Orbit Stationkeeping Maneuvers via Reinforcement Learning

21Citations
Citations of this article
14Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Reinforcement learning (RL) is used to design impulsive stationkeeping maneuvers for a spacecraft operating near an L2 quasi-halo trajectory in a Sun–Earth–Moon point mass ephemeris model with solar radiation pressure. This scenario is translated into an RL problem that reflects the desired stationkeeping goals, variables, and dynamical model. An algorithm from proximal policy optimization is used to train a policy that generates stationkeeping maneuvers while transfer learning is used to reduce the computational time required for training. The trained policy successfully generates stationkeeping maneuvers that result in boundedness to the vicinity of the selected reference trajectory with low total maneuver requirements, producing comparable results to a traditionally formulated constrained optimization scheme.

Cite

CITATION STYLE

APA

Bonasera, S., Bosanac, N., Sullivan, C. J., Elliott, I., Ahmed, N., & McMahon, J. W. (2023). Designing Sun–Earth L2 Halo Orbit Stationkeeping Maneuvers via Reinforcement Learning. Journal of Guidance, Control, and Dynamics, 46(2), 301–311. https://doi.org/10.2514/1.G006783

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free