Login / Signup
Examining the Use of Temporal-Difference Incremental Delta-Bar-Delta for Real-World Predictive Knowledge Architectures.
Johannes Günther
Nadia M. Ady
Alex Kearney
Michael Rory Dawson
Patrick M. Pilarski
Published in:
Frontiers Robotics AI (2020)
Keyphrases
</>
temporal difference
delta bar delta
td learning
evaluation function
reinforcement learning
data sets
function approximation
data mining
learning algorithm
predictive state representations
monte carlo
multi objective
state space
multi class
step size
action selection
perceptron algorithm