Login / Signup
Regularized Inverse Reinforcement Learning.
Wonseok Jeon
Chen-Yang Su
Paul Barde
Thang Doan
Derek Nowrouzezahrai
Joelle Pineau
Published in:
ICLR (2021)
Keyphrases
</>
inverse reinforcement learning
bayesian nonparametric
partially observable environments
preference elicitation
reward function
least squares
temporal difference
partially observable
machine learning
decision making
objective function
privacy preserving