Login / Signup
Scaling Laws for Reward Model Overoptimization.
Leo Gao
John Schulman
Jacob Hilton
Published in:
CoRR (2022)
Keyphrases
</>
probabilistic model
high level
computational model
statistical model
data sets
cost function
theoretical framework
neural network
mathematical model
sensitivity analysis
formal model
network model