Login / Signup
Finite-Time Performance Bounds and Adaptive Learning Rate Selection for Two Time-Scale Reinforcement Learning.
Harsh Gupta
R. Srikant
Lei Ying
Published in:
CoRR (2019)
Keyphrases
</>
reinforcement learning
adaptive learning rate
learning rate
learning algorithm
lower bound
neural network
upper bound
machine learning
state space
worst case
finite number
optimization problems
game theory