Login / Signup
Is Q-Learning Minimax Optimal? A Tight Sample Complexity Analysis.
Gen Li
Changxiao Cai
Yuxin Chen
Yuting Wei
Yuejie Chi
Published in:
Oper. Res. (2024)
Keyphrases
</>
complexity analysis
worst case
lower bound
upper bound
theoretical analysis
optimal solution
dynamic programming
multi agent
data sets
cooperative
first order logic
sample size
computational complexity
np hard
state space
sample points
mobile robot
control system
reinforcement learning