Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization.
Weiran YaoShelby HeineckeJuan Carlos NieblesZhiwei LiuYihao FengLe XueRithesh R. N.Zeyuan ChenJianguo ZhangDevansh ArpitRan XuPhil MuiHuan WangCaiming XiongSilvio SavaresePublished in: ICLR (2024)