Experiments in Parameter Learning Using Temporal Differences.
Jonathan BaxterAndrew TridgellLex WeaverPublished in: J. Int. Comput. Games Assoc. (1998)
Keyphrases
- parameter learning
- temporal difference
- bayesian networks
- reinforcement learning
- function approximation
- structure learning
- evaluation function
- generative model
- monte carlo
- statistical learning
- conditional random fields
- model free
- maximum likelihood
- em algorithm
- step size
- action selection
- parameter estimation
- approximate inference
- markov random field
- expectation maximization
- hidden variables
- data sets
- graphical models
- neural network
- machine learning algorithms
- probabilistic inference
- supervised learning
- semi supervised
- state space
- dynamic programming
- pairwise
- image processing