Optimizing Parameter Learning Using Temporal Differences.
James F. Swafford IIPublished in: AAAI/IAAI (2002)
Keyphrases
- parameter learning
- temporal difference
- bayesian networks
- reinforcement learning
- structure learning
- statistical learning
- function approximation
- generative model
- conditional random fields
- evaluation function
- model free
- maximum likelihood
- monte carlo
- step size
- parameter estimation
- markov random field
- em algorithm
- approximate inference
- graphical models
- action selection
- hidden variables
- image segmentation
- probabilistic model
- dynamic programming
- least squares
- neural network
- objective function
- expectation maximization
- markov chain
- training data