Deep vs. deep Bayesian: Faster reinforcement learning on a multi-robot competitive experiment

0Citations
Citations of this article
12Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Deep Learning experiments commonly require hundreds of trials to properly train neural networks, often labeled as Big Data, while Bayesian learning leverages scarce data points to infer next iterations, also known as Micro Data. Deep Bayesian Learning combines the complexity from multi-layered neural networks to probabilistic inferences, and it allows a robot to learn good policies within few trials in the real world. In here we propose, for the first time, an application of Deep Bayesian Reinforcement Learning (RL) on a real-world multi-robot confrontation game, and compare the algorithm with a model-free Deep RL algorithm, Deep Q-Learning. Our experiments show that DBRL significantly outperforms DRL in learning efficiency and scalability. The results of this work point to the advantages of Deep Bayesian approaches in bypassing the Reality Gap and sim-to-real implementations, as the time taken for real-world learning can quickly outperform data-intensive Deep alternatives.

Cite

CITATION STYLE

APA

Huang, J., Giardina, F., & Rosendo, A. (2021). Deep vs. deep Bayesian: Faster reinforcement learning on a multi-robot competitive experiment. In Proceedings of the 18th International Conference on Informatics in Control, Automation and Robotics, ICINCO 2021 (pp. 501–506). SciTePress. https://doi.org/10.5220/0010601905010506

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free