Login / Signup
On the Performance of Maximum Likelihood Inverse Reinforcement Learning
Héctor Ratia
Luis Montesano
Ruben Martinez-Cantin
Published in:
CoRR (2012)
Keyphrases
</>
inverse reinforcement learning
maximum likelihood
bayesian nonparametric
partially observable environments
preference elicitation
mixture model
reward function
em algorithm
expectation maximization
maximum a posteriori
hyperparameters
temporal difference
dynamic programming
gaussian distribution