Q-learning based failure detection and self-recovery algorithm for multi-robot domains

1Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.

Abstract

Task allocation is the essential part of multirobot coordination researches and it plays a significant role to achieve desired system performance. Uncertainties in multirobot systems’ working environment due to nature of them are the major hurdle for perfect coordination. When learning-based task allocation approaches are used, firstly robots learn about their working environment and then they benefit from their experiences in future task allocation process. These approaches provide useful solutions as long as environmental conditions remain unchanged. If permanent changes in environment characteristics or some failure in multi-robot system occur undesirably e.g. in disaster response which is a good example to represent such cases, the previously-learned information becomes invalid. At this point, the most important mission is to detect the failure and to recover the system initial learning state. For this purpose, Q-learning based failure detection and self-recovery algorithm is proposed in this study. According to this approach, multi-robot system checks whether these variations permanent, then recover the system to learning state if it is required. So, it provides dynamic task allocation procedure having great advantages against unforeseen situations. The experimental results verify that the proposed algorithm offer efficient solutions for multi-robot task allocation problem even in systemic failure cases.

References Powered by Scopus

11592Citations
1034Readers

This article is free to access.

Reinforcement learning: A survey

6101Citations
2458Readers

A comprehensive survey of multiagent reinforcement learning

1729Citations
1482Readers
Get full text

Cited by Powered by Scopus

19Citations
33Readers
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Ezercan Kayir, H. H. (2019). Q-learning based failure detection and self-recovery algorithm for multi-robot domains. Elektronika Ir Elektrotechnika, 25(1), 3–7. https://doi.org/10.5755/j01.eie.25.1.22728

Readers over time

‘19‘20‘21‘23‘2400.511.52

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 2

67%

Lecturer / Post doc 1

33%

Readers' Discipline

Tooltip

Nursing and Health Professions 1

33%

Mathematics 1

33%

Engineering 1

33%

Save time finding and organizing research with Mendeley

Sign up for free
0