Is Q-Learning Minimax Optimal? A Tight Sample Complexity Analysis.

Gen Li Changxiao Cai Yuxin Chen Yuting Wei Yuejie Chi

Published in: Oper. Res. (2024)

Keyphrases

complexity analysis
worst case
lower bound
upper bound
theoretical analysis
optimal solution
dynamic programming
multi agent
data sets
cooperative
first order logic
sample size
computational complexity
np hard
state space
sample points
mobile robot
control system
reinforcement learning