Relative Loss Bounds for Temporal-Difference Learning.

Jürgen Forster Manfred K. Warmuth

Published in: ICML (2000)

Keyphrases

loss bounds
temporal difference learning
function approximation
fixed point
approximate value iteration
reinforcement learning
evaluation function
game playing
temporal difference
markov decision process
expert advice
monte carlo
worst case
reinforcement learning algorithms
linear regression
state space
policy iteration
probabilistic model
neural network