Deep Reinforcement Learning: From Q-Learning to Deep Q-Learning

Fuxiao Tan; Pengfei Yan; Xinping Guan

Conference Proceedings

Deep Reinforcement Learning: From Q-Learning to Deep Q-Learning

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2017) 10637 LNCS 475-483

DOI: 10.1007/978-3-319-70093-9_50

26Citations

45Readers

Get full text

Abstract

As the two hottest branches of machine learning, deep learning and reinforcement learning both play a vital role in the field of artificial intelligence. Combining deep learning with reinforcement learning, deep reinforcement learning is a method of artificial intelligence that is much closer to human learning. As one of the most basic algorithms for reinforcement learning, Q-learning is a discrete strategic learning algorithm that uses a reasonable strategy to generate an action. According to the rewards and the next state generated by the interaction of the action and the environment, optimal Q-function can be obtained. Furthermore, based on Q-learning and convolutional neural networks, the deep Q-learning with experience replay is developed in this paper. To ensure the convergence of value function, a discount factor is involved in the value function. The temporal difference method is introduced to training the Q-function or value function. At last, a detailed procedure is proposed to implement deep reinforcement learning.

Author supplied keywords

Cite

CITATION STYLE

APA

Tan, F., Yan, P., & Guan, X. (2017). Deep Reinforcement Learning: From Q-Learning to Deep Q-Learning. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10637 LNCS, pp. 475–483). Springer Verlag. https://doi.org/10.1007/978-3-319-70093-9_50

Deep Reinforcement Learning: From Q-Learning to Deep Q-Learning

Abstract

Author supplied keywords

Cite

Register to see more suggestions