Login / Signup
Finite-Time Performance Bounds and Adaptive Learning Rate Selection for Two Time-Scale Reinforcement Learning.
Harsh Gupta
R. Srikant
Lei Ying
Published in:
NeurIPS (2019)
Keyphrases
</>
reinforcement learning
adaptive learning rate
learning rate
lower bound
state space
upper bound
neural network
learning algorithm
machine learning
finite number
globally convergent
learning problems