Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization.
Weiran YaoShelby HeineckeJuan Carlos NieblesZhiwei LiuYihao FengLe XueRithesh MurthyZeyuan ChenJianguo ZhangDevansh ArpitRan XuPhil MuiHuan WangCaiming XiongSilvio SavaresePublished in: CoRR (2023)