Information Acquisition Driven by Reinforcement in Non-Deterministic Environments

Naresh Babu Bynagari; Ruhul Amin

Journal ArticleOPEN ACCESS

Information Acquisition Driven by Reinforcement in Non-Deterministic Environments

Bynagari N
Amin R

American Journal of Trade and Policy (2019) 6(3) 107-112

DOI: 10.18034/ajtp.v6i3.569

N/ACitations

12Readers

Abstract

What is the fastest way for an agent living in a non-deterministic Markov environment (NME) to learn about its statistical properties? The answer is to create "optimal" experiment sequences by carrying out action sequences that maximize expected knowledge gain. This idea is put into practice by integrating information theory and reinforcement learning techniques. Experiments demonstrate that the resulting method, reinforcement-driven information acquisition (RDIA), is substantially faster than standard random exploration for exploring particular NMEs. Exploration was studied apart from exploitation and we evaluated the performance of different reinforcement-driven information acquisition variations to that of traditional random exploration.

Cite

CITATION STYLE

APA

Bynagari, N. B., & Amin, R. (2019). Information Acquisition Driven by Reinforcement in Non-Deterministic Environments. American Journal of Trade and Policy, 6(3), 107–112. https://doi.org/10.18034/ajtp.v6i3.569

Information Acquisition Driven by Reinforcement in Non-Deterministic Environments

Abstract

Cite

Register to see more suggestions