Login / Signup
Simultaneously Learning Stochastic and Adversarial Episodic MDPs with Known Transition.
Tiancheng Jin
Haipeng Luo
Published in:
NeurIPS (2020)
Keyphrases
</>
learning process
learning systems
learning algorithm
decision trees
supervised learning
reinforcement learning
prior knowledge
active learning
knowledge acquisition
unsupervised learning
learning automata