Differentially Private Reinforcement Learning with Linear Function Approximation

Xingyu Zhou

Conference ProceedingsOPEN ACCESS

Differentially Private Reinforcement Learning with Linear Function Approximation

Zhou X

SIGMETRICS/PERFORMANCE 2022 - Abstract Proceedings of the 2022 ACM SIGMETRICS/IFIP PERFORMANCE Joint International Conference on Measurement and Modeling of Computer Systems (2022) 77-78

DOI: 10.1145/3489048.3522648

0Citations

9Readers

Get full text

Abstract

Motivated by the wide adoption of reinforcement learning (RL) in real-world personalized services, where users' sensitive and private information needs to be protected, we study regret minimization in finite-horizon Markov decision processes (MDPs) under the constraints of differential privacy (DP). Compared to existing private RL algorithms that work only on tabular finite-state, finite-actions MDPs, we take the first step towards privacy-preserving learning in MDPs with large state and action spaces. Specifically, we consider MDPs with linear function approximation (in particular linear mixture MDPs) under the notion of joint differential privacy (JDP), where the RL agent is responsible for protecting users' sensitive data. We design two private RL algorithms that are based on value iteration and policy optimization, respectively, and show that they enjoy sub-linear regret performance while guaranteeing privacy protection. Moreover, the regret bounds are independent of the number of states, and scale at most logarithmically with the number of actions, making the algorithms suitable for privacy protection in nowadays large-scale personalized services. Our results are achieved via a general procedure for learning in linear mixture MDPs under changing regularizers, which not only generalizes previous results for non-private learning, but also serves as a building block for general private reinforcement learning.

Author supplied keywords

Cite

CITATION STYLE

APA

Zhou, X. (2022). Differentially Private Reinforcement Learning with Linear Function Approximation. In SIGMETRICS/PERFORMANCE 2022 - Abstract Proceedings of the 2022 ACM SIGMETRICS/IFIP PERFORMANCE Joint International Conference on Measurement and Modeling of Computer Systems (pp. 77–78). Association for Computing Machinery, Inc. https://doi.org/10.1145/3489048.3522648

Differentially Private Reinforcement Learning with Linear Function Approximation

Abstract

Author supplied keywords

Cite

Register to see more suggestions