Sign in

KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal.

Tadashi KozunoWenhao YangNino VieillardToshinori KitamuraYunhao TangJincheng MeiPierre MénardMohammad Gheshlaghi AzarMichal ValkoRémi MunosOlivier PietquinMatthieu GeistCsaba Szepesvári
Published in: CoRR (2022)
Keyphrases