Relative Loss Bounds for Temporal-Difference Learning.
Jürgen ForsterManfred K. WarmuthPublished in: ICML (2000)
Keyphrases
- loss bounds
- temporal difference learning
- function approximation
- fixed point
- approximate value iteration
- reinforcement learning
- evaluation function
- game playing
- temporal difference
- markov decision process
- expert advice
- monte carlo
- worst case
- reinforcement learning algorithms
- linear regression
- state space
- policy iteration
- probabilistic model
- neural network