Latent exploration for Reinforcement Learning.

Alberto Silvio Chiappa Alessandro Marin Vargas Ann Zixiang Huang Alexander Mathis

Published in: NeurIPS (2023)

Keyphrases

reinforcement learning
active exploration
exploration strategy
action selection
model based reinforcement learning
exploration exploitation
autonomous learning
function approximation
exploration exploitation tradeoff
latent variables
markov decision processes
machine learning
optimal policy
temporal difference
state space
learning process
transfer learning
reinforcement learning algorithms
model free
optimal control
latent space
function approximators
temporal difference learning
transition model
probabilistic model
multi agent
objective function