Multi-Level Progressive Reinforcement Learning for Control Policy in Physical Simulations.
Kefei WuXuming HeYang WangXiaopei LiuPublished in: ICRA (2024)
Keyphrases
- control policy
- reinforcement learning
- approximate dynamic programming
- admission control
- control policies
- long run
- batch mode
- function approximation
- state space
- markov decision processes
- temporal difference
- learning algorithm
- simulation model
- model free
- control strategies
- optimal policy
- machine learning
- multi layer
- numerical simulations
- dynamic programming
- multi agent
- physical models