Keyphrases
- artificial neural networks
- theoretic analysis
- temporal difference learning
- function approximation
- neural network
- fixed point
- evaluation function
- game playing
- approximate value iteration
- temporal difference
- reinforcement learning
- semi supervised
- pairwise
- linear combination
- control strategy
- monte carlo
- markov chain
- least squares
- probability distribution
- e learning
- learning algorithm