Sign in

Constraint-Generation Policy Optimization (CGPO): Nonlinear Programming for Policy Optimization in Mixed Discrete-Continuous MDPs.

Michael GimelfarbAyal TaitlerScott Sanner
Published in: CoRR (2024)
Keyphrases