Login / Signup
Uncertainty Estimation for Language Reward Models.
Adam Gleave
Geoffrey Irving
Published in:
CoRR (2022)
Keyphrases
</>
programming language
experimental data
parametric models
probabilistic model
least squares
reinforcement learning
prior knowledge
model selection
parameter estimation
complex systems
statistical models
classification models
model fitting
conceptual models
modelling language