Login / Signup
Inverse Reinforcement Learning with Simultaneous Estimation of Rewards and Dynamics.
Michael Herman
Tobias Gindele
Jörg Wagner
Felix Schmitt
Wolfram Burgard
Published in:
CoRR (2016)
Keyphrases
</>
inverse reinforcement learning
reward function
bayesian nonparametric
reinforcement learning
partially observable environments
markov decision processes
reinforcement learning algorithms
state space
dynamical systems
multiple agents
transition probabilities
temporal difference
simple examples