Login / Signup

Hybrid Temporal-Difference Algorithm Using Sliding Mode Control and Sigmoid Function.

Ke XuFengge Wu
Published in: PRICAI (2016)
Keyphrases
  • optimization algorithm
  • learning algorithm
  • dynamic programming
  • convergence rate
  • neural network
  • multi objective
  • cost function
  • td learning
  • machine learning
  • model free
  • temporal difference