An Actor-Critic Reinforcement Learning for Device-to-Device Communication Underlaying Cellular Network.

Pratap Khuntia Ranjay Hazra

Published in: TENCON (2018)

Keyphrases

reinforcement learning
actor critic
cellular networks
data acquisition
function approximation
temporal difference
multi agent
real time
optimal control
reinforcement learning algorithms
policy iteration
state space
transfer learning
model free
decision making
learning algorithm
approximation methods
machine learning