Learning the Reward Function for a Misspecified Model.

Published in: CoRR (2018)

Keyphrases

learning algorithm
prior knowledge
active learning
probabilistic model
statistical model
reinforcement learning
objective function
state space
data mining
machine learning
information extraction
higher order
conditional random fields
decision theoretic
transition model