Login / Signup

Nonuniqueness and Convergence to Equivalent Solutions in Observer-based Inverse Reinforcement Learning.

Jared TownZachary MorrisonRushikesh Kamalapurkar
Published in: CoRR (2022)
Keyphrases
  • inverse reinforcement learning
  • partially observable environments
  • bayesian nonparametric
  • special case
  • preference elicitation
  • machine learning
  • genetic algorithm
  • convergence rate