Login / Signup
Learning the Reward Function for a Misspecified Model.
Erik Talvitie
Published in:
CoRR (2018)
Keyphrases
</>
learning algorithm
prior knowledge
active learning
probabilistic model
statistical model
reinforcement learning
objective function
state space
data mining
machine learning
information extraction
higher order
conditional random fields
decision theoretic
transition model