The last-step minimax algorithm

15Citations
Citations of this article
17Readers
Mendeley users who have this article in their library.
Get full text

Abstract

We consider on-line density estimation with a parameterized density from an exponential family. In each trial t the learner predicts a parameter θt. Then it receives an instance xt chosen by the adversary and incurs loss -ln p(xt|θt) which is the negative log-likelihood of xt w.r.t. the predicted density of the learner. The performance of the learner is measured by the regret defined as the total loss of the learner minus the total loss of the best parameter chosen off-line. We develop an algorithm called the Last-step Minimax Algorithm that predicts with the minimax optimal parameter assuming that the current trial is the last one. For one-dimensional exponential families, we give an explicit form of the prediction of the Last-step Minimax Algorithm and show that its regret is O(ln T), where T is the number of trials. In particular, for Bernoulli density estimation the Last-step Minimax Algorithm is slightly better than the standard Krichevsky-Trofimov probability estimator.

Cite

CITATION STYLE

APA

Takimoto, E., & Warmuth, M. K. (2000). The last-step minimax algorithm. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 1968, pp. 279–290). Springer Verlag. https://doi.org/10.1007/3-540-40992-0_21

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free