Learning from Demonstration for Shaping through Inverse Reinforcement Learning.

Halit Bener Suay Tim Brys Matthew E. Taylor Sonia Chernova

Published in: AAMAS (2016)

Keyphrases

inverse reinforcement learning
bayesian nonparametric
partially observable environments
preference elicitation
reward function
temporal difference
artificial intelligence
fuzzy logic