Login / Signup
Simultaneously Learning Stochastic and Adversarial Episodic MDPs with Known Transition.
Tiancheng Jin
Haipeng Luo
Published in:
CoRR (2020)
Keyphrases
</>
reinforcement learning
learning process
learning algorithm
online learning
learning problems
learning systems
prior knowledge
supervised learning
knowledge acquisition
active learning
mobile learning
monte carlo
markov decision processes
learning tasks
partially observable
stochastic domains