Abstract
The application of reinforcement learning to the optimal control of building systems has gained traction in recent years as it can cut the building energy consumption and improve human comfort. Despite using sample-efficient reinforcement learning algorithms, most related work requires several months of sensor data and operational parameters of the building to train an agent that outperforms existing rule-based controllers in a large multi-zone building. Moreover, exploring the large state and action spaces can result in poor indoor environmental quality for occupants. In this paper, we propose to reduce the training cost of a policy gradient reinforcement learning algorithm by learning a library of control policies on a training building and taking advantage of both environmental and policy diversity. To transfer these policies to a target building, which can be different from the training building, we develop a simple method to assign the best pretrained policy in the library to each zone of the target building. We show that even without retraining the transferred policies on the target building, they can reduce the HVAC energy consumption by 40.4% compared to a fixed-schedule baseline and by 48.97% compared to agents trained on the target building for 5,000 months. The plausibility of our results underscores the importance of using diversity and transfer learning in multi-Agent reinforcement learning settings and could pave the way for the adoption of reinforcement-learning based controllers in real buildings.
Author supplied keywords
Cite
CITATION STYLE
Zhang, T., Aakash Krishna, G. S., Afshari, M., Musilek, P., Taylor, M. E., & Ardakanian, O. (2022). Diversity for transfer in learning-based control of buildings. In e-Energy 2022 - Proceedings of the 2022 13th ACM International Conference on Future Energy Systems (pp. 556–564). Association for Computing Machinery, Inc. https://doi.org/10.1145/3538637.3539615
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.