Login / Signup
Policy Distillation and Value Matching in Multiagent Reinforcement Learning.
Samir Wadhwania
Dong-Ki Kim
Shayegan Omidshafiei
Jonathan P. How
Published in:
IROS (2019)
Keyphrases
</>
multiagent reinforcement learning
markov games
joint action
multiagent systems
multi agent
cooperative
stochastic games
markov decision process
optimal policy
learning algorithm
reinforcement learning
markov decision processes
function approximation
infinite horizon