Login / Signup
Sample Complexity of Neural Policy Mirror Descent for Policy Optimization on Low-Dimensional Manifolds.
Zhenghao Xu
Xiang Ji
Minshuo Chen
Mengdi Wang
Tuo Zhao
Published in:
CoRR (2023)
Keyphrases
</>
sample complexity
optimal policy
theoretical analysis
learning algorithm
upper bound
machine learning
data analysis
lower bound
probabilistic model
state space
nonlinear dimensionality reduction