Login / Signup
Models of human preference for learning reward functions.
W. Bradley Knox
Stephane Hatgis-Kessell
Serena Booth
Scott Niekum
Peter Stone
Alessandro Gabriele Allievi
Published in:
Trans. Mach. Learn. Res. (2024)
Keyphrases
</>
learning algorithm
reinforcement learning
prior knowledge
active learning
computational models
inverse reinforcement learning
search algorithm
probabilistic model
higher order
maximum likelihood
sufficient conditions
generative model
complex systems
conditional random fields