Login / Signup
Batch size-invariance for policy optimization.
Jacob Hilton
Karl Cobbe
John Schulman
Published in:
NeurIPS (2022)
Keyphrases
</>
batch size
optimal policy
optimization problems
asymptotically optimal
finite horizon
multi class
poisson process
order quantity
reinforcement learning
special case
np hard
multistage
infinite horizon
markov decision process
batch mode
batch processing