Keyphrases
- temporal difference learning
- quasi newton
- step size
- temporal difference
- function approximation
- fixed point
- game playing
- optimization method
- reinforcement learning
- evaluation function
- optimization methods
- newton method
- reinforcement learning algorithms
- markov decision process
- monte carlo
- optimization algorithm
- multi objective
- convergence rate
- evolutionary algorithm
- function approximators
- pairwise
- learning algorithm