Login / Signup
Learning Adversarial Low-rank Markov Decision Processes with Unknown Transition and Full-information Feedback.
Canzhe Zhao
Ruofeng Yang
Baoxiang Wang
Xuezhou Zhang
Shuai Li
Published in:
CoRR (2023)
Keyphrases
</>
markov decision processes
reinforcement learning
low rank
learning process
learning algorithm
partially observable
learning tasks
search space
state space
supervised learning
learning problems
training data
least squares
high order
singular value decomposition