Login / Signup
RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs.
Afra Feyza Akyürek
Ekin Akyürek
Aman Madaan
Ashwin Kalyan
Peter Clark
Derry Wijaya
Niket Tandon
Published in:
CoRR (2023)
Keyphrases
</>
reinforcement learning
natural language
mathematical model
probabilistic model
computational model
statistical model
neural network
dynamic programming
mobile robot
markov decision processes
function approximation
learning algorithm
learning process
optimal policy