Login / Signup
Quantifying the Sensitivity of Inverse Reinforcement Learning to Misspecification.
Joar Max Viktor Skalse
Alessandro Abate
Published in:
ICLR (2024)
Keyphrases
</>
inverse reinforcement learning
bayesian nonparametric
partially observable environments
preference elicitation
reward function
sensitivity analysis
temporal difference
reinforcement learning
cost function
state space
sufficient conditions