Login / Signup
WARM: On the Benefits of Weight Averaged Reward Models.
Alexandre Ramé
Nino Vieillard
Léonard Hussenot
Robert Dadashi
Geoffrey Cideron
Olivier Bachem
Johan Ferret
Published in:
CoRR (2024)
Keyphrases
</>
reinforcement learning
probabilistic model
complex systems
experimental data
accurate models
machine learning
database systems
search algorithm
neural network model
bayesian framework
learned models