Login / Signup
Agent57: Outperforming the Atari Human Benchmark.
Adrià Puigdomènech Badia
Bilal Piot
Steven Kapturowski
Pablo Sprechmann
Alex Vitvitskyi
Zhaohan Daniel Guo
Charles Blundell
Published in:
CoRR (2020)
Keyphrases
</>
multi agent
multi agent systems
multiagent systems
human users
decision making
autonomous agents
machine learning
least squares
learning algorithm
random walk
fixed point
incomplete information