Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice.
Toshinori KitamuraTadashi KozunoYunhao TangNino VieillardMichal ValkoWenhao YangJincheng MeiPierre MénardMohammad Gheshlaghi AzarRémi MunosOlivier PietquinMatthieu GeistCsaba SzepesváriWataru KumagaiYutaka MatsuoPublished in: CoRR (2023)
Keyphrases