Tree Search-Based Policy Optimization under Stochastic Execution Delay.
David ValensiEsther DermanShie MannorGal DalalPublished in: ICLR (2024)
Keyphrases
- tree search
- stochastic search
- mathematical programming
- branch and bound
- search algorithm
- constraint propagation
- game tree search
- search tree
- tree search algorithm
- monte carlo
- depth first search
- combinatorial optimization
- alpha beta
- optimal policy
- path finding
- iterative deepening
- optimization problems
- neural network
- game tree
- search space
- machine learning