A Partially Observable Monte Carlo Planning Algorithm Based on Path Modification.
Qingya WangFeng LiuBin LuoPublished in: ACML (2023)
Keyphrases
- monte carlo
- importance sampling
- learning algorithm
- search space
- monte carlo simulation
- dynamic programming
- partially observable
- optimal solution
- computational complexity
- decision problems
- optimal strategy
- state space
- np hard
- matrix inversion
- machine learning
- optimal control
- monte carlo tree search
- heuristic search
- expectation maximization
- linear programming
- decision makers
- search algorithm
- objective function
- reinforcement learning