Finite-Time Performance Bounds and Adaptive Learning Rate Selection for Two Time-Scale Reinforcement Learning.

Harsh Gupta R. Srikant Lei Ying

Published in: CoRR (2019)

Keyphrases

reinforcement learning
adaptive learning rate
learning rate
learning algorithm
lower bound
neural network
upper bound
machine learning
state space
worst case
finite number
optimization problems
game theory