A soft actor-critic deep reinforcement learning method for multi-timescale coordinated operation of microgrids

Chunchao Hu; Zexiang Cai; Yanxu Zhang; Rudai Yan; Yu Cai; Bowei Cen

Journal ArticleOPEN ACCESS

A soft actor-critic deep reinforcement learning method for multi-timescale coordinated operation of microgrids

Protection and Control of Modern Power Systems (2022) 7(1)

DOI: 10.1186/s41601-022-00252-z

32Citations

17Readers

Abstract

This paper develops a multi-timescale coordinated operation method for microgrids based on modern deep reinforcement learning. Considering the complementary characteristics of different storage devices, the proposed approach achieves multi-timescale coordination of battery and supercapacitor by introducing a hierarchical two-stage dispatch model. The first stage makes an initial decision irrespective of the uncertainties using the hourly predicted data to minimize the operational cost. For the second stage, it aims to generate corrective actions for the first-stage decisions to compensate for real-time renewable generation fluctuations. The first stage is formulated as a non-convex deterministic optimization problem, while the second stage is modeled as a Markov decision process solved by an entropy-regularized deep reinforcement learning method, i.e., the Soft Actor-Critic. The Soft Actor-Critic method can efficiently address the exploration–exploitation dilemma and suppress variations. This improves the robustness of decisions. Simulation results demonstrate that different types of energy storage devices can be used at two stages to achieve the multi-timescale coordinated operation. This proves the effectiveness of the proposed method.

Author supplied keywords

Cite

CITATION STYLE

APA

Hu, C., Cai, Z., Zhang, Y., Yan, R., Cai, Y., & Cen, B. (2022). A soft actor-critic deep reinforcement learning method for multi-timescale coordinated operation of microgrids. Protection and Control of Modern Power Systems, 7(1). https://doi.org/10.1186/s41601-022-00252-z

A soft actor-critic deep reinforcement learning method for multi-timescale coordinated operation of microgrids

Abstract

Author supplied keywords

Cite

Register to see more suggestions