Integrating reinforcement learning with human demonstrations of varying ability.
Matthew E. TaylorHalit Bener SuaySonia ChernovaPublished in: AAMAS (2011)
Keyphrases
- reinforcement learning
- human beings
- function approximation
- cognitive abilities
- optimal policy
- markov decision processes
- reinforcement learning algorithms
- human interaction
- key features
- learning algorithm
- database
- state space
- dynamic programming
- human behavior
- optimal control
- learning process
- temporal difference
- neural network
- data sets