Quantifying the Sensitivity of Inverse Reinforcement Learning to Misspecification.

Joar Max Viktor Skalse Alessandro Abate

Published in: ICLR (2024)

Keyphrases

inverse reinforcement learning
bayesian nonparametric
partially observable environments
preference elicitation
reward function
sensitivity analysis
temporal difference
reinforcement learning
cost function
state space
sufficient conditions