Orchestrated Value Mapping for Reinforcement Learning.
Mehdi FatemiArash TavakoliPublished in: ICLR (2022)
Keyphrases
- reinforcement learning
- function approximation
- state space
- model free
- decision making
- multi agent
- learning process
- robotic control
- temporal difference learning
- reinforcement learning algorithms
- direct policy search
- database
- multi agent reinforcement learning
- reinforcement learning methods
- function approximators
- learning problems
- dynamic programming
- expert systems
- website
- artificial intelligence
- data sets