Login / Signup
Learning the Reward Function for a Misspecified Model.
Erik Talvitie
Published in:
ICML (2018)
Keyphrases
</>
objective function
probabilistic model
learning algorithm
search engine
maximum entropy
decision theoretic
inverse reinforcement learning
prior knowledge
higher order
learning tasks
state variables
decision theory