Goal directed design of rewards and training features for self-learning agents in a human-autonomy-teaming environment

Simon Schwerd; Sebastian Lindner; Axel Schulte

Conference Proceedings

Goal directed design of rewards and training features for self-learning agents in a human-autonomy-teaming environment

Advances in Intelligent Systems and Computing (2020) 1131 AISC 925-931

DOI: 10.1007/978-3-030-39512-4_141

1Citations

11Readers

Get full text

Abstract

For architects of reinforcement learning (RL) agents in real-world applications, the design of a suitable training environment is challenging. Feature engineering and reward function design are mainly guided by human intuition and domain knowledge. We propose the application of a goal directed task analysis (GDTA) to structure knowledge about an application domain in order to guide the design of a RL agent embedded in a complex environment. Results from the task analysis can be leveraged for feature selection and reward function design. We showcase this approach in a human-autonomy-teaming application for military airborne operations.

Author supplied keywords

Cite

CITATION STYLE

APA

Schwerd, S., Lindner, S., & Schulte, A. (2020). Goal directed design of rewards and training features for self-learning agents in a human-autonomy-teaming environment. In Advances in Intelligent Systems and Computing (Vol. 1131 AISC, pp. 925–931). Springer. https://doi.org/10.1007/978-3-030-39512-4_141

Goal directed design of rewards and training features for self-learning agents in a human-autonomy-teaming environment

Abstract

Author supplied keywords

Cite

Register to see more suggestions