Potential-based algorithms in online prediction and game theory

Nicolò Cesa-Bianchi; Gábor Lugosi

Conference Proceedings

Potential-based algorithms in online prediction and game theory

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2001) 2111 48-64

DOI: 10.1007/3-540-44581-1_4

3Citations

53Readers

Get full text

Abstract

In this paper we show that several known algorithms for sequential prediction problems (including the quasi-additive family of Grove et al. and Littlestone and Warmuth’s Weighted Majority), for playing iterated games (including Freund and Schapire’s Hedge and MW, as well as the Λ-strategies of Hart and Mas-Colell), and for boosting (including AdaBoost) are special cases of a general decision strategy based on the notion of potential. By analyzing this strategy we derive known performance bounds, as well as new bounds, as simple corollaries of a single general theorem. Besides offering a new and unified view on a large family of algorithms, we establish a connection between potential-based analysis in learning and their counterparts independently developed in game theory. By exploiting this connection, we show that certain learning problems are instances of more general game-theoretic problems. In particular, we describe a notion of generalized regret and show its applications in learning theory.

Author supplied keywords

Cite

CITATION STYLE

APA

Cesa-Bianchi, N., & Lugosi, G. (2001). Potential-based algorithms in online prediction and game theory. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2111, pp. 48–64). Springer Verlag. https://doi.org/10.1007/3-540-44581-1_4

Potential-based algorithms in online prediction and game theory

Abstract

Author supplied keywords

Cite

Register to see more suggestions