Login / Signup
Nonuniqueness and Convergence to Equivalent Solutions in Observer-based Inverse Reinforcement Learning.
Jared Town
Zachary Morrison
Rushikesh Kamalapurkar
Published in:
CoRR (2022)
Keyphrases
</>
inverse reinforcement learning
partially observable environments
bayesian nonparametric
special case
preference elicitation
machine learning
genetic algorithm
convergence rate