Login / Signup
Learning from Demonstration for Shaping through Inverse Reinforcement Learning.
Halit Bener Suay
Tim Brys
Matthew E. Taylor
Sonia Chernova
Published in:
AAMAS (2016)
Keyphrases
</>
inverse reinforcement learning
bayesian nonparametric
partially observable environments
preference elicitation
reward function
temporal difference
artificial intelligence
fuzzy logic