Reward learning from human preferences and demonstrations in Atari.

Borja Ibarz Jan Leike Tobias Pohlen Geoffrey Irving Shane Legg Dario Amodei

Published in: NeurIPS (2018)

Keyphrases

reinforcement learning
active learning
learning algorithm
neural network
decision making
learning process
supervised learning
cooperative
multi agent systems
learning tasks