Login / Signup
A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning.
Dong-Ki Kim
Miao Liu
Matthew Riemer
Chuangchuang Sun
Marwa Abdulhai
Golnaz Habibi
Sebastian Lopez-Cot
Gerald Tesauro
Jonathan P. How
Published in:
CoRR (2020)
Keyphrases
</>
learning algorithm
policy gradient
actor critic
computational complexity
learning process
cost function
optimal solution
search space
convergence rate
gradient method
cooperative
np hard
dynamic programming
learning tasks
multi robot
stochastic games
machine learning