Login / Signup
CLARE: Conservative Model-Based Reward Learning for Offline Inverse Reinforcement Learning.
Sheng Yue
Guanbo Wang
Wei Shao
Zhaofeng Zhang
Sen Lin
Ju Ren
Junshan Zhang
Published in:
ICLR (2023)
Keyphrases
</>
inverse reinforcement learning
partially observable environments
reinforcement learning
bayesian nonparametric
learning tasks
reward function
preference elicitation
learning algorithm
artificial intelligence
prior knowledge
supervised learning
model free
partially observable