Improving Approximate Value Iteration Using Memories and Predictive State Representations.

Michael R. James Ton Wessling Nikos A. Vlassis

Published in: AAAI (2006)

Keyphrases

predictive state representations
approximate value iteration
fixed point
dynamical systems
temporal difference learning
stochastic systems
temporal difference
training data
active learning
evaluation function