Bridging the Reality Gap Between Virtual and Physical Environments Through Reinforcement Learning

Mahesh Ranaweera; Qusay H. Mahmoud

Journal ArticleOPEN ACCESS

Bridging the Reality Gap Between Virtual and Physical Environments Through Reinforcement Learning

IEEE Access (2023) 11 19914-19927

DOI: 10.1109/ACCESS.2023.3249572

6Citations

25Readers

Abstract

Creating Reinforcement learning(RL) agents that can perform tasks in the real-world robotic systems remains a challenging task due to inconsistencies between the virtual-and the real-world. This is known as the 'reality-gap' which hinders the performance of a RL agent trained in a virtual environment. The research describes the techniques used to train the models, generate randomized environments, reward function, and techniques utilized to transfer the model to the physical environment for evaluation. For this investigation, a low-cost 3-degrees-of-freedom (DOF) Steward platform was 3D modeled and created virtually and physically. The goal of the 3D-Stewart platform was to guide and balance the marble towards the center. Custom end-to-end APIs were developed to interact with the Godot game engine, manipulate physics and dynamics, interact with the in-game lighting and perform environment randomizations. Two RL algorithms: Q-learning and Actor-Critic, were implemented to evaluate the performance by using domain randomization and induced noise to bridge the reality gap. For Q-learning, raw frames were used to make predictions while Actor-Critic utilized marble position, velocity vector and relative position by pre-processing captured frames. The experimental results show the effectiveness of domain randomization and introduction of noise during the training.

Author supplied keywords

Cite

CITATION STYLE

APA

Ranaweera, M., & Mahmoud, Q. H. (2023). Bridging the Reality Gap Between Virtual and Physical Environments Through Reinforcement Learning. IEEE Access, 11, 19914–19927. https://doi.org/10.1109/ACCESS.2023.3249572

Bridging the Reality Gap Between Virtual and Physical Environments Through Reinforcement Learning

Abstract

Author supplied keywords

Cite

Register to see more suggestions