Login / Signup
Sublinear Regret for Learning POMDPs.
Yi Xiong
Ningyuan Chen
Xuefeng Gao
Xiang Zhou
Published in:
CoRR (2021)
Keyphrases
</>
reinforcement learning
online learning
learning algorithm
learning process
prior knowledge
supervised learning
learning systems
learning problems
lower bound
active learning
knowledge acquisition
machine learning
e learning
learning styles
markov decision processes
inductive inference