Login / Signup
Policy Gradient Algorithms for Robust MDPs with Non-Rectangular Uncertainty Sets.
Mengmeng Li
Tobias Sutter
Daniel Kuhn
Published in:
CoRR (2023)
Keyphrases
</>
reinforcement learning
learning algorithm
computational complexity
optimization problems
markov decision processes
gradient ascent
policy search
multi agent
search space
least squares
robust optimization
policy gradient