Login / Signup
SPRING: GPT-4 Out-performs RL Algorithms by Studying Papers and Reasoning.
Yue Wu
Shrimai Prabhumoye
So Yeon Min
Yonatan Bisk
Ruslan Salakhutdinov
Amos Azaria
Tom M. Mitchell
Yuanzhi Li
Published in:
CoRR (2023)
Keyphrases
</>
reinforcement learning
rl algorithms
model free
optimal control
markov chain
multi label
dynamical systems