Login / Signup

Nonuniqueness and Convergence to Equivalent Solutions in Observer-based Inverse Reinforcement Learning.

Jared TownZachary MorrisonRushikesh Kamalapurkar
Published in: ACC (2023)
Keyphrases
  • inverse reinforcement learning
  • bayesian nonparametric
  • convergence speed
  • partially observable environments
  • decision making
  • preference elicitation
  • multi agent
  • optimal solution