Login / Signup
RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback.
Harrison Lee
Samrat Phatale
Hassan Mansoor
Kellie Lu
Thomas Mesnard
Colton Bishop
Victor Carbune
Abhinav Rastogi
Published in:
CoRR (2023)
Keyphrases
</>
reinforcement learning
human operators
relevance feedback
learning algorithm
artificial intelligence
motor skills
expert systems
case based reasoning
user engagement
data mining
artificial general intelligence
visual feedback
user feedback
intelligent systems
computer science
decision making
machine learning