A turbo Q-learning (TQL) for energy efficiency optimization in heterogeneous networks

Xiumin Wang; Lei Li; Jun Li; Zhengquan Li

Journal ArticleOPEN ACCESS

A turbo Q-learning (TQL) for energy efficiency optimization in heterogeneous networks

Entropy (2020) 22(9)

DOI: 10.3390/e22090957

4Citations

5Readers

Abstract

In order to maximize energy efficiency in heterogeneous networks (HetNets), a turbo Q-Learning (TQL) combined with multistage decision process and tabular Q-Learning is proposed to optimize the resource configuration. For the large dimensions of action space, the problem of energy efficiency optimization is designed as a multistage decision process in this paper, according to the resource allocation of optimization objectives, the initial problem is divided into several subproblems which are solved by tabular Q-Learning, and the traditional exponential increasing size of action space is decomposed into linear increase. By iterating the solutions of subproblems, the initial problem is solved. The simple stability analysis of the algorithm is given in this paper. As to the large dimension of state space, we use a deep neural network (DNN) to classify states where the optimization policy of novel Q-Learning is set to label samples. Thus far, the dimensions of action and state space have been solved. The simulation results show that our approach is convergent, improves the convergence speed by 60% while maintaining almost the same energy efficiency and having the characteristics of system adjustment.

Author supplied keywords

Cite

CITATION STYLE

APA

Wang, X., Li, L., Li, J., & Li, Z. (2020). A turbo Q-learning (TQL) for energy efficiency optimization in heterogeneous networks. Entropy, 22(9). https://doi.org/10.3390/e22090957

A turbo Q-learning (TQL) for energy efficiency optimization in heterogeneous networks

Abstract

Author supplied keywords

Cite

Register to see more suggestions