Login / Signup
SALMON: Self-Alignment with Principle-Following Reward Models.
Zhiqing Sun
Yikang Shen
Hongxin Zhang
Qinhong Zhou
Zhenfang Chen
David D. Cox
Yiming Yang
Chuang Gan
Published in:
CoRR (2023)
Keyphrases
</>
model selection
statistical models
information systems
face recognition
training data
reinforcement learning
parameter estimation
statistical model
experimental data