Deep Reinforcement Learning-Based RMSA Policy Distillation for Elastic Optical Networks

17Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.

Abstract

The reinforcement learning-based routing, modulation, and spectrum assignment has been regarded as an emerging paradigm for resource allocation in the elastic optical networks. One limitation is that the learning process is highly dependent on the training environment, such as the traffic pattern or the optical network topology. Therefore, re-training is required in case of network topology or traffic pattern variations, which consumes a great amount of computation power and time. To ease the requirement of re-training, we propose a policy distillation scheme, which distills knowledge from a well-trained teacher model and then transfers the knowledge to the to-be-trained student model, so that the training of the latter can be accelerated. Specifically, the teacher model is trained for one training environment (e.g., the topology and traffic pattern) and the student model is for another training environment. The simulation results indicate that our proposed method can effectively speed up the training process of the student model, and it even leads to a lower blocking probability, compared with the case that the student model is trained without knowledge distillation.

Cite

CITATION STYLE

APA

Tang, B., Huang, Y. C., Xue, Y., & Zhou, W. (2022). Deep Reinforcement Learning-Based RMSA Policy Distillation for Elastic Optical Networks. Mathematics, 10(18). https://doi.org/10.3390/math10183293

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free