Sign in

Inverse Reinforcement Learning with Sub-optimal Experts.

Riccardo PoianiGabriele CurtiAlberto Maria MetelliMarcello Restelli
Published in: CoRR (2024)
Keyphrases
  • inverse reinforcement learning
  • partially observable environments
  • dynamic programming
  • bayesian nonparametric
  • preference elicitation
  • optimal solution
  • worst case
  • markov chain
  • utility function
  • optimal control