Latent exploration for Reinforcement Learning.
Alberto Silvio ChiappaAlessandro Marin VargasAnn Zixiang HuangAlexander MathisPublished in: NeurIPS (2023)
Keyphrases
- reinforcement learning
- active exploration
- exploration strategy
- action selection
- model based reinforcement learning
- exploration exploitation
- autonomous learning
- function approximation
- exploration exploitation tradeoff
- latent variables
- markov decision processes
- machine learning
- optimal policy
- temporal difference
- state space
- learning process
- transfer learning
- reinforcement learning algorithms
- model free
- optimal control
- latent space
- function approximators
- temporal difference learning
- transition model
- probabilistic model
- multi agent
- objective function