Login / Signup

RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs.

Afra Feyza AkyürekEkin AkyürekAman MadaanAshwin KalyanPeter ClarkDerry WijayaNiket Tandon
Published in: CoRR (2023)
Keyphrases