Login / Signup
Integrating reinforcement learning with human demonstrations of varying ability.
Matthew E. Taylor
Halit Bener Suay
Sonia Chernova
Published in:
AAMAS (2011)
Keyphrases
</>
reinforcement learning
human beings
function approximation
cognitive abilities
optimal policy
markov decision processes
reinforcement learning algorithms
human interaction
key features
learning algorithm
database
state space
dynamic programming
human behavior
optimal control
learning process
temporal difference
neural network
data sets