Lightweight Monte Carlo Algorithm for Markov Decision Processes.
Axel LegaySean SedwardsPublished in: CoRR (2013)
Keyphrases
- monte carlo
- lightweight
- markov decision processes
- dynamic programming
- monte carlo simulation
- learning algorithm
- model based reinforcement learning
- markov chain
- computational complexity
- optimal solution
- importance sampling
- policy evaluation
- finite state
- policy iteration
- optimal policy
- search space
- monte carlo tree search
- average reward
- objective function
- probability distribution
- state abstraction
- game tree
- np hard
- state space