Login / Signup

Bayesian Reward Models for LLM Alignment.

Adam X. YangMaxime RobeynsThomas CosteJun WangHaitham Bou-AmmarLaurence Aitchison
Published in: CoRR (2024)
Keyphrases
  • statistical models
  • reinforcement learning
  • experimental data
  • data sets
  • genetic algorithm
  • bayesian networks
  • multi agent systems
  • pairwise
  • prior knowledge
  • probabilistic model
  • statistical methods
  • mathematical models