A Privacy-Preserving Reinforcement Learning Approach for Dynamic Treatment Regimes on Health Data

2Citations
Citations of this article
8Readers
Mendeley users who have this article in their library.

Abstract

Based on the clinical states of the patient, dynamic treatment regime technology can provide various therapeutic methods, which is helpful for medical treatment policymaking. Reinforcement learning is an important approach for developing this technology. In order to implement the reinforcement learning algorithm efficiently, the computation of health data is usually outsourced to the untrustworthy cloud server. However, it may leak, falsify, or delete private health data. Encryption is a common method for solving this problem. But the cloud server is difficult to calculate encrypted health data. In this paper, based on Cheon et al.'s approximate homomorphic encryption scheme, we first propose secure computation protocols for implementing comparison, maximum, exponentiation, and division. Next, we design a homomorphic reciprocal of square root protocol firstly, which only needs one approximate computation. Based on the proposed secure computation protocols, we design a secure asynchronous advantage actor-critic reinforcement learning algorithm for the first time. Then, it is used to implement a secure treatment decision-making algorithm. Simulation results show that our secure computation protocols and algorithms are feasible.

Cite

CITATION STYLE

APA

Sun, X., Sun, Z., Wang, T., Feng, J., Wei, J., & Hu, G. (2021). A Privacy-Preserving Reinforcement Learning Approach for Dynamic Treatment Regimes on Health Data. Wireless Communications and Mobile Computing, 2021. https://doi.org/10.1155/2021/8952219

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free