Variational Bayesian Reinforcement Learning with Regret Bounds.
Brendan O'DonoghuePublished in: NeurIPS (2021)
Keyphrases
- variational bayesian
- reinforcement learning
- regret bounds
- matrix factorization
- expectation maximization
- exponential family
- incomplete data
- online learning
- learning algorithm
- density function
- state space
- collaborative filtering
- maximum likelihood
- learning problems
- mean shift
- image segmentation
- lower bound
- missing values
- transfer learning
- semi supervised
- mixture model
- em algorithm
- text mining
- principal component analysis
- supervised learning