Inverse Reinforcement Learning by Estimating Expertise of Demonstrators.

Mark Beliaev Ramtin Pedarsani

Published in: CoRR (2024)

Keyphrases

inverse reinforcement learning
bayesian nonparametric
partially observable environments
preference elicitation
reward function
temporal difference
search algorithm
expert systems