Login / Signup

Regret-Optimal Model-Free Reinforcement Learning for Discounted MDPs with Short Burn-In Time.

Xiang JiGen Li
Published in: CoRR (2023)
Keyphrases