Login / Signup

Understanding the Effects of RLHF on LLM Generalisation and Diversity.

Robert KirkIshita MedirattaChristoforos NalmpantisJelena LuketinaEric HambroEdward GrefenstetteRoberta Raileanu
Published in: CoRR (2023)
Keyphrases
  • mechanisms underlying
  • social networks
  • decision making
  • metadata
  • three dimensional
  • wide range
  • video sequences
  • cooperative
  • hidden markov models
  • particle swarm optimization