Login / Signup
Learning to summarize from human feedback.
Nisan Stiennon
Long Ouyang
Jeff Wu
Daniel M. Ziegler
Ryan Lowe
Chelsea Voss
Alec Radford
Dario Amodei
Paul F. Christiano
Published in:
CoRR (2020)
Keyphrases
</>
learning algorithm
learning systems
learning process
prior knowledge
supervised learning
knowledge acquisition
language acquisition
genetic algorithm
computer programming
active learning
learning scheme
human learning