Abstract
Variational quantum circuit is proposed for applications in supervised learning and reinforcement learning to harness potential quantum advantage. However, many practical applications in robotics and time-series analysis are in partially observable environment. In this work, we propose an algorithm based on variational quantum circuits for reinforcement learning under partially observable environment. Simulations suggest learning advantage over several classical counterparts. The learned parameters are then tested on IBMQ systems to demonstrate the applicability of our approach for real-machine-based predictions.
Cite
CITATION STYLE
Kimura, T., Shiba, K., Chen, C. C., Sogabe, M., Sakamoto, K., & Sogabe, T. (2021). Variational Quantum Circuit-Based Reinforcement Learning for POMDP and Experimental Implementation. Mathematical Problems in Engineering, 2021. https://doi.org/10.1155/2021/3511029
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.