Login / Signup
A Family of \(\boldsymbol{s}\)-Rectangular Robust MDPs: Relative Conservativeness, Asymptotic Analyses, and Finite-Sample Properties.
Sivaramakrishnan Ramani
Archis Ghate
Published in:
SIAM J. Optim. (2024)
Keyphrases
</>
finite sample
sample size
statistical learning theory
uniform convergence
error bounds
nearest neighbor
parzen window
reinforcement learning
knn
generalization error
training data
least squares
theoretical analysis
supervised classification
generalization bounds