Policy Optimization with Stochastic Mirror Descent.
Long YangYu ZhangGang ZhengQian ZhengPengfei LiJianhang HuangGang PanPublished in: AAAI (2022)
Keyphrases
- stochastic optimization
- stochastic search
- optimization algorithm
- stochastic programming
- optimization method
- global optimization
- optimization process
- optimization problems
- genetic algorithm
- optimal policy
- state dependent
- direct search
- expected cost
- stochastic model
- optimization model
- constrained optimization
- monte carlo
- multistage
- multi objective
- learning algorithm