C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
Models of human preference for learning reward functions.
W. Bradley Knox
Stephane Hatgis-Kessell
Serena Booth
Scott Niekum
Peter Stone
Alessandro Allievi
Published in:
CoRR (2022)
Keyphrases
</>
reinforcement learning
supervised learning
active learning
learning algorithm
search algorithm
prior knowledge
computational models
hidden variables
inverse reinforcement learning
transition probabilities
preference elicitation