Login / Signup
Titanic TD 27.
Bob Hoffman
Published in:
SIGGRAPH Visual Proceedings (1997)
Keyphrases
</>
temporal difference
td learning
temporal difference learning
reinforcement learning
learning algorithm
evaluation function
artificial intelligence
function approximation
reinforcement learning algorithms
real world
decision making
multimedia
training data
lower bound
special case
monte carlo