• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

Offline Reinforcement Learning with Closed-Form Policy Improvement Operators.

Jiachen LiEdwin ZhangMing YinQinxun BaiYu-Xiang WangWilliam Yang Wang
Published in: CoRR (2022)
Keyphrases