Tree Search-Based Policy Optimization under Stochastic Execution Delay.
David ValensiEsther DermanShie MannorGal DalalPublished in: CoRR (2024)
Keyphrases
- tree search
- stochastic search
- mathematical programming
- branch and bound
- search algorithm
- constraint propagation
- game tree search
- alpha beta
- iterative deepening
- depth first search
- tree search algorithm
- optimization problems
- monte carlo
- search tree
- game tree
- combinatorial optimization
- search space
- neural network
- optimal policy
- multi dimensional
- state space
- reinforcement learning