Towards Instance-Optimal Offline Reinforcement Learning with Pessimism.
Ming YinYu-Xiang WangPublished in: CoRR (2021)
Keyphrases
- multi agent
- reinforcement learning
- dynamic programming
- reinforcement learning algorithms
- real time
- multi agent reinforcement learning
- optimal control
- learning algorithm
- optimal design
- transfer learning
- artificial neural networks
- optimal solution
- optimal policy
- state space
- exhaustive search
- temporal difference
- control problems
- decision trees