Uncertainty Estimation for Language Reward Models.

Adam Gleave Geoffrey Irving

Published in: CoRR (2022)

Keyphrases

programming language
experimental data
parametric models
probabilistic model
least squares
reinforcement learning
prior knowledge
model selection
parameter estimation
complex systems
statistical models
classification models
model fitting
conceptual models
modelling language