Login / Signup
SALMON: Self-Alignment with Instructable Reward Models.
Zhiqing Sun
Yikang Shen
Hongxin Zhang
Qinhong Zhou
Zhenfang Chen
David Daniel Cox
Yiming Yang
Chuang Gan
Published in:
ICLR (2024)
Keyphrases
</>
modeling framework
statistical models
accurate models
data mining
reinforcement learning
probabilistic model
parameter estimation
complex systems
process model
mathematical models