Login / Signup

An unified approach to inverse reinforcement learning by oppositive demonstrations.

Kao-Shing HwangWei-Cheng JiangYi-Chia Tseng
Published in: ICIT (2016)
Keyphrases
  • inverse reinforcement learning
  • bayesian nonparametric
  • partially observable environments
  • preference elicitation
  • reward function
  • temporal difference
  • search algorithm
  • special case
  • decision making