Login / Signup
Model-free Learning from Demonstration.
Erik Alexander Billing
Thomas Hellström
Lars-Erik Janlert
Published in:
ICAART (2) (2010)
Keyphrases
</>
model free
reinforcement learning
function approximation
temporal difference
reinforcement learning algorithms
policy iteration
average reward
policy evaluation
machine learning
optical flow
state space
linear programming
rl algorithms