Login / Signup
Understanding the Effects of RLHF on LLM Generalisation and Diversity.
Robert Kirk
Ishita Mediratta
Christoforos Nalmpantis
Jelena Luketina
Eric Hambro
Edward Grefenstette
Roberta Raileanu
Published in:
ICLR (2024)
Keyphrases
</>
mechanisms underlying
cooperative
real time
machine learning
information retrieval
computer vision
case study
multi objective
probabilistic model
negative effects
theoretical and practical implications