Login / Signup
Nonuniqueness and Convergence to Equivalent Solutions in Observer-based Inverse Reinforcement Learning.
Jared Town
Zachary Morrison
Rushikesh Kamalapurkar
Published in:
ACC (2023)
Keyphrases
</>
inverse reinforcement learning
bayesian nonparametric
convergence speed
partially observable environments
decision making
preference elicitation
multi agent
optimal solution