Keyphrases
- monte carlo
- policy search
- monte carlo methods
- reinforcement learning
- reinforcement learning algorithms
- continuous state
- temporal difference
- markov chain
- importance sampling
- dynamic programming
- variance reduction
- policy gradient
- partially observable markov decision processes
- state space
- reward function
- finite state
- particle filter
- search space
- monte carlo tree search
- function approximators
- monte carlo method
- markov decision problems
- machine learning
- markov chain monte carlo
- optimal strategy
- reinforcement learning methods