Login / Signup
Inverse Reinforcement Learning by Estimating Expertise of Demonstrators.
Mark Beliaev
Ramtin Pedarsani
Published in:
CoRR (2024)
Keyphrases
</>
inverse reinforcement learning
bayesian nonparametric
partially observable environments
preference elicitation
reward function
temporal difference
search algorithm
expert systems