EPMC: Every Visit Preference Monte Carlo for Reinforcement Learning.
Christian WirthJohannes FürnkranzPublished in: ACML (2013)
Keyphrases
- monte carlo
- reinforcement learning
- temporal difference
- stochastic approximation
- policy evaluation
- monte carlo simulation
- markov chain
- function approximation
- temporal difference learning
- reinforcement learning algorithms
- importance sampling
- monte carlo tree search
- monte carlo methods
- simulation study
- particle filter
- state space
- model free
- markov decision processes
- markovian decision
- adaptive sampling
- optimal control
- variance reduction
- global illumination
- machine learning
- policy iteration
- game tree search
- monte carlo method
- optimal strategy
- matrix inversion
- supervised learning
- learning process
- learning algorithm