Login / Signup

Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game.

Wei XiongHan ZhongChengshuai ShiCong ShenLiwei WangTong Zhang
Published in: CoRR (2022)
Keyphrases