Login / Signup

RAT selection for IoT devices in HetNets: Reinforcement learning with hybrid SMDP algorithm.

Hongyi BianQingmiao ZhangJunhui ZhaoHuan Zhang
Published in: Phys. Commun. (2022)
Keyphrases
  • reinforcement learning
  • dynamic programming
  • learning algorithm
  • search space
  • objective function
  • np hard
  • response time
  • markov decision processes