Block Policy Mirror Descent.

Guanghui Lan Yan Li Tuo Zhao

Published in: CoRR (2022)

Keyphrases

optimal policy
field of view
infinite horizon
dct coefficients
block size
asymptotically optimal
markov decision process
state dependent
allocation policy