Login / Signup
Models of human preference for learning reward functions.
W. Bradley Knox
Stephane Hatgis-Kessell
Serena Booth
Scott Niekum
Peter Stone
Alessandro Allievi
Published in:
CoRR (2022)
Keyphrases
</>
reinforcement learning
supervised learning
active learning
learning algorithm
search algorithm
prior knowledge
computational models
hidden variables
inverse reinforcement learning
transition probabilities
preference elicitation