Login / Signup

Inverse Reinforcement Learning by Estimating Expertise of Demonstrators.

Mark BeliaevRamtin Pedarsani
Published in: CoRR (2024)
Keyphrases
  • inverse reinforcement learning
  • bayesian nonparametric
  • partially observable environments
  • preference elicitation
  • reward function
  • temporal difference
  • search algorithm
  • expert systems