Learning to interact with learning agents

8Citations
Citations of this article
26Readers
Mendeley users who have this article in their library.

Abstract

AI and machine learning methods are increasingly interacting with and seeking information from people, robots, and other learning agents. Consequently, the learning dynamics of these agents creates fundamentally new challenges for existing methods. Motivated by the application of learning to offer personalized deals to users, we highlight these challenges by studying a variant of the framework of “online learning using expert advice with bandit feedback". In our setting, we consider each expert as a learning agent, seeking to more accurately reflect real-world applications. The bandit feedback leads to additional challenges in this setting: at time t, only the expert i t that has been selected by the central algorithm (forecaster) receives feedback from the environment and gets to learn at this time. A natural question to ask is whether it is possible to be competitive with the best expert j ∗ had it seen all the feedback, i.e., competitive with the policy of always selecting expert j ∗ . We prove the following hardness result'without any coordination between the forecaster and the experts, it is impossible to design a forecaster achieving no-regret guarantees. We then consider a practical assumption allowing the forecaster to guide the learning process of the experts by blocking some of the feedback observed by them from the environment, i.e., restricting the selected expert i t to learn at time t for some time steps. With this additional coordination power, we design our forecaster LIL that achieves no-regret guarantees, and we provide regret bounds dependent on the learning dynamics of the best expert j ∗

Cite

CITATION STYLE

APA

Singla, A., Hassani, H., & Krause, A. (2018). Learning to interact with learning agents. In 32nd AAAI Conference on Artificial Intelligence, AAAI 2018 (pp. 4083–4090). AAAI press. https://doi.org/10.1609/aaai.v32i1.11739

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free