Login / Signup
Improving Approximate Value Iteration Using Memories and Predictive State Representations.
Michael R. James
Ton Wessling
Nikos A. Vlassis
Published in:
AAAI (2006)
Keyphrases
</>
predictive state representations
approximate value iteration
fixed point
dynamical systems
temporal difference learning
stochastic systems
temporal difference
training data
active learning
evaluation function