Bias Optimality

Mark E. Lewis; Martin L. Puterman

Book Chapter

Bias Optimality

Lewis M
Puterman M

DOI: 10.1007/978-1-4615-0805-2_3

N/ACitations

9Readers

Get full text

Abstract

The use of the long-run average reward or the gain as an optimality criterion has received considerable attention in the literature. However, for many practical models the gain has the undesirable property of being underselective, that is, there may be several gain optimal policies. After finding the set of policies that achieve the primary objective of maximizing the long-run average reward one might search for that which maximizes the "short-run" reward. This reward, called the bias aids in distinguishing among multiple gain optimal policies. This chapter focuses on establishing the usefulness of the bias in distinguishing among multiple gain optimal policies, computing it and demonstrating the implicit discounting captured by bias on recurrent states.

Cite

CITATION STYLE

APA

Lewis, M. E., & Puterman, M. L. (2002). Bias Optimality (pp. 89–111). https://doi.org/10.1007/978-1-4615-0805-2_3

Bias Optimality

Abstract

Cite

Register to see more suggestions