Temporal Difference Bayesian Model Averaging: A Bayesian Perspective on Adapting Lambda.
Carlton DowneyScott SannerPublished in: ICML (2010)
Keyphrases
- temporal difference
- bayesian model averaging
- model selection
- model averaging
- posterior distribution
- td learning
- reinforcement learning
- evaluation function
- feature selection
- function approximation
- variable selection
- monte carlo
- step size
- model free
- bayesian networks
- cross validation
- action selection
- latent variables
- probability distribution
- supervised learning
- bayesian inference
- machine learning
- parameter estimation
- markov chain monte carlo
- gaussian processes
- cost function
- bayesian framework
- active learning
- maximum a posteriori
- feature extraction