Exploiting Causal Graph Priors with Posterior Sampling for Reinforcement Learning.
Mirco MuttiRiccardo De SantiMarcello RestelliAlexander MarxGiorgia RamponiPublished in: ICLR (2024)
Keyphrases
- causal graph
- reinforcement learning
- bayesian framework
- causal models
- planning problems
- markov chain monte carlo
- metropolis hastings algorithm
- proposal distribution
- state space
- posterior distribution
- state variables
- optimal policy
- markov decision processes
- particle filter
- monte carlo
- np hardness
- generative model
- plan generation
- machine learning
- gaussian process
- probability distribution