• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

Offline Actor-Critic Reinforcement Learning Scales to Large Models.

Jost Tobias SpringenbergAbbas AbdolmalekiJingwei ZhangOliver GrothMichael BloeschThomas LampePhilemon BrakelSarah BechtleSteven KapturowskiRoland HafnerNicolas HeessMartin A. Riedmiller
Published in: CoRR (2024)
Keyphrases
  • reinforcement learning
  • actor critic
  • function approximation
  • model free
  • temporal difference
  • gradient method