On the role of tracking in stationary environments

58Citations
Citations of this article
102Readers
Mendeley users who have this article in their library.
Get full text

Abstract

It is often thought that learning algorithms that track the best solution, as opposed to converging to it, are important only on nonstationary problems. We present three results suggesting that this is not so. First we illustrate in a simple concrete example, the Black and White problem, that tracking can perform better than any converging algorithm on a stationary problem. Second, we show the same point on a larger, more realistic problem, an application of temporal difference learning to computer Go. Our third result suggests that tracking in stationary problems could be important for metalearning research (e.g., learning to learn, feature selection, transfer). We apply a metalearning algorithm for step-size adaptation, IDBD (Sutton, 1992a), to the Black and White problem, showing that meta-learning has a dramatic long-term effect on performance whereas, on an analogous converging problem, meta-learning has only a small second-order effect.

Cite

CITATION STYLE

APA

Sutton, R. S., Koop, A., & Silver, D. (2007). On the role of tracking in stationary environments. In ACM International Conference Proceeding Series (Vol. 227, pp. 871–878). https://doi.org/10.1145/1273496.1273606

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free