Login / Signup
Reward learning from human preferences and demonstrations in Atari.
Borja Ibarz
Jan Leike
Tobias Pohlen
Geoffrey Irving
Shane Legg
Dario Amodei
Published in:
NeurIPS (2018)
Keyphrases
</>
reinforcement learning
active learning
learning algorithm
neural network
decision making
learning process
supervised learning
cooperative
multi agent systems
learning tasks