Login / Signup
Infer and Adapt: Bipedal Locomotion Reward Learning from Demonstrations via Inverse Reinforcement Learning.
Feiyang Wu
Zhaoyuan Gu
Hanran Wu
Anqi Wu
Ye Zhao
Published in:
CoRR (2023)
Keyphrases
</>
inverse reinforcement learning
partially observable environments
reinforcement learning
bayesian nonparametric
learning algorithm
preference elicitation
control system
dynamic systems
reward function