Login / Signup
Dynamic handoff policy for RAN slicing by exploiting deep reinforcement learning.
Yuansheng Wu
Guanqun Zhao
Dadong Ni
Junyi Du
Published in:
EURASIP J. Wirel. Commun. Netw. (2021)
Keyphrases
</>
reinforcement learning
optimal policy
dynamic programming
function approximation
policy search
machine learning
state space
dynamic environments
partially observable environments
learning algorithm
wireless networks
mobile devices
response time
markov decision processes
action selection
markov decision problems