Cooperation in wireless networks: A game-theoretic framework with reinforcement learning

6Citations
Citations of this article
14Readers
Mendeley users who have this article in their library.

Abstract

A game-theoretic framework based on the iterated prisoner's dilemma (IPD) is proposed to model the repeated dynamic interactions of multiple source nodes when communicating with multiple destinations in an ad hoc wireless network. In such networks where nodes are autonomous, selfish and not familiar with other nodes' strategies, fully cooperative behaviours cannot be assumed. Therefore reinforcement learning is studied to relate the utility function of each source node to actions previously taken in order to learn a strategy that maximises their expected future reward. Particularly, a Q-learning algorithm is proposed to allow network nodes to adapt to and play the IPD game against opponents with a variety of known and unknown strategies. Simulation results illustrate that the proposed Q-learning algorithm allows network nodes to play optimally and achieve their maximum expected return values. © The Institution of Engineering and Technology 2014.

Cite

CITATION STYLE

APA

Baidas, M. W. (2014). Cooperation in wireless networks: A game-theoretic framework with reinforcement learning. IET Communications, 8(5), 740–753. https://doi.org/10.1049/iet-com.2013.0817

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free