Login / Signup
Block Policy Mirror Descent.
Guanghui Lan
Yan Li
Tuo Zhao
Published in:
CoRR (2022)
Keyphrases
</>
optimal policy
field of view
infinite horizon
dct coefficients
block size
asymptotically optimal
markov decision process
state dependent
allocation policy