Optimal Resource Allocation in Wireless Control Systems via Deep Policy Gradient.
Vinicius Lima SilvaMark EisenKonstantinos GatsisAlejandro RibeiroPublished in: CoRR (2019)
Keyphrases
- policy gradient
- optimal resource allocation
- control system
- resource allocation
- reinforcement learning
- actor critic
- parametric optimization
- function approximation
- optimal control
- approximation methods
- reinforcement learning algorithms
- average reward
- gradient method
- model free reinforcement learning
- control strategies
- variance reduction
- reinforcement learning methods
- partially observable markov decision processes
- control law
- control strategy
- neural network