Orchestrated Value Mapping for Reinforcement Learning.

Mehdi Fatemi Arash Tavakoli

Published in: ICLR (2022)

Keyphrases

reinforcement learning
function approximation
state space
model free
decision making
multi agent
learning process
robotic control
temporal difference learning
reinforcement learning algorithms
direct policy search
database
multi agent reinforcement learning
reinforcement learning methods
function approximators
learning problems
dynamic programming
expert systems
website
artificial intelligence
data sets