Login / Signup
Model-free iterative learning of time-optimal point-to-point motions for LTI systems.
Pieter Janssens
Goele Pipeleers
Jan Swevers
Published in:
CDC/ECC (2011)
Keyphrases
</>
model free
iterative learning
reinforcement learning
function approximation
temporal difference
average reward
reinforcement learning algorithms
policy iteration