Convergence Guarantees for Deep Epsilon Greedy Policy Learning.

Michael Rawson Radu Balan

Published in: CoRR (2021)

Keyphrases

online learning
learning process
learning algorithm
reinforcement learning
supervised learning
unsupervised learning
learning tasks
neural network
machine learning
prior knowledge
knowledge acquisition
learning problems
dynamic programming
incremental learning
action selection
elementary school