Learning payoff functions in infinite games

Yevgeniy Vorobeychik; Michael P. Wellman; Satinder Singh

Journal ArticleOPEN ACCESS

Learning payoff functions in infinite games

Machine Learning (2007) 67(1-2) 145-168

DOI: 10.1007/s10994-007-0715-8

51Citations

51Readers

Get full text

Abstract

We consider a class of games with real-valued strategies and payoff information available only in the form of data from a given sample of strategy profiles. Solving such games with respect to the underlying strategy space requires generalizing from the data to a complete payoff-function representation. We address payoff-function learning as a standard regression problem, with provision for capturing known structure (e.g., symmetry) in the multiagent environment. To measure learning performance, we consider the relative utility of prescribed strategies, rather than the accuracy of payoff functions per se. We demonstrate our approach and evaluate its effectiveness on two examples: a two-player version of the first-price sealed-bid auction (with known analytical form), and a five-player market-based scheduling game (with no known solution). Additionally, we explore the efficacy of using relative utility of strategies as a target of supervised learning and as a learning model selector. Our experiments demonstrate its effectiveness in the former case, though not in the latter. © Springer Science + Business Media, LLC 2007.

Author supplied keywords

Cite

CITATION STYLE

APA

Vorobeychik, Y., Wellman, M. P., & Singh, S. (2007). Learning payoff functions in infinite games. Machine Learning, 67(1–2), 145–168. https://doi.org/10.1007/s10994-007-0715-8

Learning payoff functions in infinite games

Abstract

Author supplied keywords

Cite

Register to see more suggestions