Model-free Learning from Demonstration.

Erik Alexander Billing Thomas Hellström Lars-Erik Janlert

Published in: ICAART (2) (2010)

Keyphrases

model free
reinforcement learning
function approximation
temporal difference
reinforcement learning algorithms
policy iteration
average reward
policy evaluation
machine learning
optical flow
state space
linear programming
rl algorithms