Adaptive Cyber Defense Technique Based on Multiagent Reinforcement Learning Strategies

3Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.

Abstract

The static nature of cyber defense systems gives attackers a sufficient amount of time to explore and further exploit the vulnerabilities of information technology systems. In this paper, we investigate a problem where multiagent systems sensing and acting in an environment contribute to adaptive cyber defense. We present a learning strategy that enables multiple agents to learn optimal poli-cies using multiagent reinforcement learning (MARL). Our proposed approach is inspired by the multiarmed bandits (MAB) learning technique for multiple agents to cooperate in decision making or to work independently. We study a MAB approach in which defenders visit a system multiple times in an alternating fash-ion to maximize their rewards and protect their system. We find that this game can be modeled from an individual player’s perspective as a restless MAB problem. We discover further results when the MAB takes the form of a pure birth process, such as a myopic optimal policy, as well as providing environments that offer the necessary incentives required for cooperation in multiplayer projects.

Cite

CITATION STYLE

APA

Alshamrani, A., & Alshahrani, A. (2023). Adaptive Cyber Defense Technique Based on Multiagent Reinforcement Learning Strategies. Intelligent Automation and Soft Computing, 36(3), 2757–2771. https://doi.org/10.32604/iasc.2023.032835

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free