Power Allocation in Dual Connectivity Networks Based on Actor-Critic Deep Reinforcement Learning.

Elham Moein Ramin Hasibi Mehdi Rasti Matin Shokri

Published in: WiOpt (2019)

Keyphrases

actor critic
reinforcement learning
temporal difference
policy gradient
power allocation
approximate dynamic programming
optimal control
reinforcement learning algorithms
neuro fuzzy
function approximation
policy iteration
gradient method
mimo systems
model free
markov decision processes
state space
learning algorithm
resource allocation
real time
reinforcement learning methods
dynamical systems
single agent
action selection
rl algorithms
network structure
monte carlo