C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
Distance-rank Aware Sequential Reward Learning for Inverse Reinforcement Learning with Sub-optimal Demonstrations.
Lu Li
Yuxin Pan
Ruobing Chen
Jie Liu
Zilin Wang
Yu Liu
Zhiheng Li
Published in:
CoRR (2023)
Keyphrases
</>
inverse reinforcement learning
partially observable environments
reinforcement learning
learning algorithm
preference elicitation
bayesian nonparametric
decision making
reward function
dynamic programming
supervised learning
optimal control