Learning Optimal Q-Function Using Deep Boltzmann Machine for Reliable Trading of Cryptocurrency

Seok Jun Bu; Sung Bae Cho

Conference Proceedings

Learning Optimal Q-Function Using Deep Boltzmann Machine for Reliable Trading of Cryptocurrency

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 11314 LNCS 468-480

DOI: 10.1007/978-3-030-03493-1_49

10Citations

27Readers

Get full text

Abstract

The explosive price volatility from the end of 2017 to January 2018 shows that bitcoin is a high risk asset. The deep reinforcement algorithm is straightforward idea for directly outputs the market management actions to achieve higher profit instead of higher price-prediction accuracy. However, existing deep reinforcement learning algorithms including Q-learning are also limited to problems caused by enormous searching space. We propose a combination of double Q-network and unsupervised pre-training using Deep Boltzmann Machine (DBM) to generate and enhance the optimal Q-function in cryptocurrency trading. We obtained the profit of 2,686% in simulation, whereas the best conventional model had that of 2,087% for the same period of test. In addition, our model records 24% of profit while market price significantly drops by −64%.

Author supplied keywords

Cite

CITATION STYLE

APA

Bu, S. J., & Cho, S. B. (2018). Learning Optimal Q-Function Using Deep Boltzmann Machine for Reliable Trading of Cryptocurrency. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11314 LNCS, pp. 468–480). Springer Verlag. https://doi.org/10.1007/978-3-030-03493-1_49

Learning Optimal Q-Function Using Deep Boltzmann Machine for Reliable Trading of Cryptocurrency

Abstract

Author supplied keywords

Cite

Register to see more suggestions