Login / Signup

Scaling Laws for Reward Model Overoptimization.

Leo GaoJohn SchulmanJacob Hilton
Published in: CoRR (2022)
Keyphrases
  • probabilistic model
  • high level
  • computational model
  • statistical model
  • data sets
  • cost function
  • theoretical framework
  • neural network
  • mathematical model
  • sensitivity analysis
  • formal model
  • network model