An Actor-Critic Reinforcement Learning for Device-to-Device Communication Underlaying Cellular Network.
Pratap KhuntiaRanjay HazraPublished in: TENCON (2018)
Keyphrases
- reinforcement learning
- actor critic
- cellular networks
- data acquisition
- function approximation
- temporal difference
- multi agent
- real time
- optimal control
- reinforcement learning algorithms
- policy iteration
- state space
- transfer learning
- model free
- decision making
- learning algorithm
- approximation methods
- machine learning