Contextual Action with Multiple Policies Inverse Reinforcement Learning for Behavior Simulation.

Nahum ÁlvarezItsuki Noda
Published in: ICAART (2) (2019)
Keyphrases
  • inverse reinforcement learning
  • reward function
  • bayesian nonparametric
  • partially observable environments
  • artificial intelligence
  • reinforcement learning
  • real robot
  • control policies