Login / Signup
Learning Reward Models for Cooperative Trajectory Planning with Inverse Reinforcement Learning and Monte Carlo Tree Search.
Karl Kurzer
Matthias Bitzer
J. Marius Zöllner
Published in:
CoRR (2022)
Keyphrases
</>
inverse reinforcement learning
partially observable environments
reinforcement learning
learning algorithm
trajectory planning
preference elicitation
monte carlo tree search
neural network
monte carlo
probabilistic model
learning tasks
long run
reward function