With the development of wireless communication technology and the lack of spectrum resources, it is very meaningful to study the dynamic spectrum allocation in the cognitive Internet of Things. In this paper, the system model is firstly established. In an underlay mode, considering the interference between primary and secondary users, jointing channel selection and power allocation, aiming to maximize the spectrum efficiency of all secondary users. Different from the traditional heuristic algorithm, the underlay-cognitive-radio-deep-Q-network frame-work (UCRDQN) based on deep reinforcement learning, is proposed to find the optimal solution efficiently. The simulation results show that the UCRDQN algorithm can achieve higher spectrum efficiency and is more stable and efficient than other algorithms.
CITATION STYLE
Zheng, W., Wu, G., Qie, W., & Zhang, Y. (2019). Deep reinforcement learning for joint channel selection and power allocation in cognitive internet of things. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11956 LNCS, pp. 683–692). Springer. https://doi.org/10.1007/978-3-030-37429-7_69
Mendeley helps you to discover research relevant for your work.