Login / Signup
RAT selection for IoT devices in HetNets: Reinforcement learning with hybrid SMDP algorithm.
Hongyi Bian
Qingmiao Zhang
Junhui Zhao
Huan Zhang
Published in:
Phys. Commun. (2022)
Keyphrases
</>
reinforcement learning
dynamic programming
learning algorithm
search space
objective function
np hard
response time
markov decision processes