Login / Signup

Model-free iterative learning of time-optimal point-to-point motions for LTI systems.

Pieter JanssensGoele PipeleersJan Swevers
Published in: CDC/ECC (2011)
Keyphrases
  • model free
  • iterative learning
  • reinforcement learning
  • function approximation
  • temporal difference
  • average reward
  • reinforcement learning algorithms
  • policy iteration