C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
Simultaneously Learning Stochastic and Adversarial Episodic MDPs with Known Transition.
Tiancheng Jin
Haipeng Luo
Published in:
CoRR (2020)
Keyphrases
</>
reinforcement learning
learning process
learning algorithm
online learning
learning problems
learning systems
prior knowledge
supervised learning
knowledge acquisition
active learning
mobile learning
monte carlo
markov decision processes
learning tasks
partially observable
stochastic domains