Login / Signup
Regularized Inverse Reinforcement Learning.
Wonseok Jeon
Chen-Yang Su
Paul Barde
Thang Doan
Derek Nowrouzezahrai
Joelle Pineau
Published in:
CoRR (2020)
Keyphrases
</>
inverse reinforcement learning
bayesian nonparametric
partially observable environments
preference elicitation
reward function
least squares
objective function
machine learning
reinforcement learning
temporal difference
artificial intelligence
multi agent
markov decision process