Login / Signup

SPRING: GPT-4 Out-performs RL Algorithms by Studying Papers and Reasoning.

Yue WuShrimai PrabhumoyeSo Yeon MinYonatan BiskRuslan SalakhutdinovAmos AzariaTom M. MitchellYuanzhi Li
Published in: CoRR (2023)
Keyphrases
  • reinforcement learning
  • rl algorithms
  • model free
  • optimal control
  • markov chain
  • multi label
  • dynamical systems