Login / Signup

Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection.

Kyungjae LeeDasol HwangSunghyun ParkYoungsoo JangMoontae Lee
Published in: CoRR (2024)
Keyphrases