Model-based Lifelong Reinforcement Learning with Bayesian Exploration.
Haotian FuShangqun YuMichael L. LittmanGeorge KonidarisPublished in: NeurIPS (2022)
Keyphrases
- reinforcement learning
- model free
- exploration exploitation
- active exploration
- exploration strategy
- action selection
- model based reinforcement learning
- data driven
- function approximation
- bandit problems
- interval estimation
- bayesian networks
- st century
- active learning
- posterior probability
- autonomous learning
- machine learning
- reinforcement learning algorithms
- bayesian estimation
- maximum likelihood
- state space
- learning algorithm
- bayesian learning
- posterior distribution
- optimal control
- optimal policy
- technology enhanced learning
- bayesian inference
- temporal difference learning
- transfer learning
- dynamic programming
- evolutionary algorithm
- bayesian decision
- learning process
- multi agent
- exploration exploitation tradeoff