The Power of Forgetting: Improving the Last-Good-Reply Policy in Monte Carlo Go.
Hendrik BaierPeter D. DrakePublished in: IEEE Trans. Comput. Intell. AI Games (2010)
Keyphrases
- monte carlo
- markov chain
- policy evaluation
- importance sampling
- monte carlo methods
- monte carlo simulation
- optimal strategy
- monte carlo tree search
- matrix inversion
- point processes
- temporal difference
- optimal policy
- adaptive sampling
- monte carlo method
- simulation study
- markovian decision
- particle filter
- markov chain monte carlo
- stochastic approximation
- game tree search
- variance reduction
- markov decision process