Login / Signup
An unified approach to inverse reinforcement learning by oppositive demonstrations.
Kao-Shing Hwang
Wei-Cheng Jiang
Yi-Chia Tseng
Published in:
ICIT (2016)
Keyphrases
</>
inverse reinforcement learning
bayesian nonparametric
partially observable environments
preference elicitation
reward function
temporal difference
search algorithm
special case
decision making