Calibrated Model-Based Deep Reinforcement Learning.
Ali MalikVolodymyr KuleshovJiaming SongDanny NemerHarlan SeymourStefano ErmonPublished in: ICML (2019)
Keyphrases
- reinforcement learning
- model free
- function approximation
- temporal difference
- markov decision processes
- case study
- optimal policy
- machine learning
- decision making
- multi agent
- reinforcement learning algorithms
- optimal control
- multi agent reinforcement learning
- reinforcement learning methods
- markov decision process
- learning problems
- multi view
- learning process
- website
- computer vision