Login / Signup
Convergence Guarantees for Deep Epsilon Greedy Policy Learning.
Michael Rawson
Radu Balan
Published in:
CoRR (2021)
Keyphrases
</>
online learning
learning process
learning algorithm
reinforcement learning
supervised learning
unsupervised learning
learning tasks
neural network
machine learning
prior knowledge
knowledge acquisition
learning problems
dynamic programming
incremental learning
action selection
elementary school