Login / Signup
Inverse Reinforcement Learning from Summary Data.
Antti Kangasrääsiö
Samuel Kaski
Published in:
CoRR (2017)
Keyphrases
</>
inverse reinforcement learning
bayesian nonparametric
partially observable environments
preference elicitation
reward function
temporal difference
utility function
learning algorithm
decision making
reinforcement learning
cost function
em algorithm
constraint satisfaction problems
partially observable