Login / Signup
Refined Analysis of FPL for Adversarial Markov Decision Processes.
Yuanhao Wang
Kefan Dong
Published in:
CoRR (2020)
Keyphrases
</>
markov decision processes
optimal policy
state space
reinforcement learning
dynamic programming
policy iteration
transition matrices
data mining
learning algorithm
np hard
infinite horizon
planning under uncertainty
action sets