Login / Signup
A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning.
Dong-Ki Kim
Miao Liu
Matthew Riemer
Chuangchuang Sun
Marwa Abdulhai
Golnaz Habibi
Sebastian Lopez-Cot
Gerald Tesauro
Jonathan P. How
Published in:
ICML (2021)
Keyphrases
</>
learning algorithm
policy gradient
reinforcement learning
computational complexity
cost function
actor critic
learning process
monte carlo
neural network
optimal solution
search space
learning tasks
function approximation
multi robot
stochastic games
multiagent reinforcement learning