Q-learning based failure detection and self-recovery algorithm for multi-robot domains

Hatice Hilal Ezercan Kayir

Journal ArticleOPEN ACCESS

Q-learning based failure detection and self-recovery algorithm for multi-robot domains

Ezercan Kayir H

Elektronika ir Elektrotechnika (2019) 25(1) 3-7

DOI: 10.5755/j01.eie.25.1.22728

1Citations

5Readers

Abstract

Task allocation is the essential part of multirobot coordination researches and it plays a significant role to achieve desired system performance. Uncertainties in multirobot systems’ working environment due to nature of them are the major hurdle for perfect coordination. When learning-based task allocation approaches are used, firstly robots learn about their working environment and then they benefit from their experiences in future task allocation process. These approaches provide useful solutions as long as environmental conditions remain unchanged. If permanent changes in environment characteristics or some failure in multi-robot system occur undesirably e.g. in disaster response which is a good example to represent such cases, the previously-learned information becomes invalid. At this point, the most important mission is to detect the failure and to recover the system initial learning state. For this purpose, Q-learning based failure detection and self-recovery algorithm is proposed in this study. According to this approach, multi-robot system checks whether these variations permanent, then recover the system to learning state if it is required. So, it provides dynamic task allocation procedure having great advantages against unforeseen situations. The experimental results verify that the proposed algorithm offer efficient solutions for multi-robot task allocation problem even in systemic failure cases.

Author supplied keywords

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Ezercan Kayir, H. H. (2019). Q-learning based failure detection and self-recovery algorithm for multi-robot domains. Elektronika Ir Elektrotechnika, 25(1), 3–7. https://doi.org/10.5755/j01.eie.25.1.22728

Readers over time

Readers' Seniority

PhD / Post grad / Masters / Doc 2

67%

Lecturer / Post doc 1

33%

Readers' Discipline

Nursing and Health Professions 1

33%

Mathematics 1

33%

Engineering 1

33%

Q-learning based failure detection and self-recovery algorithm for multi-robot domains

Abstract

Author supplied keywords

References Powered by Scopus

Technical Note: Q-Learning

Reinforcement learning: A survey

A comprehensive survey of multiagent reinforcement learning

Cited by Powered by Scopus

Reinforcement learning rebirth, techniques, challenges, and resolutions

Register to see more suggestions

Cite

Readers over time

Readers' Seniority

Readers' Discipline