Login / Signup

Exploiting Multi-step Sample Trajectories for Approximate Value Iteration.

Robert William WrightSteven LoscalzoPhilip DexterLei Yu
Published in: ECML/PKDD (1) (2013)
Keyphrases
  • multi step
  • approximate value iteration
  • fixed point
  • temporal difference learning
  • knn
  • sample size
  • k nearest neighbor
  • learning algorithm
  • semi supervised
  • mutual information
  • monte carlo