C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
SALMON: Self-Alignment with Principle-Following Reward Models.
Zhiqing Sun
Yikang Shen
Hongxin Zhang
Qinhong Zhou
Zhenfang Chen
David D. Cox
Yiming Yang
Chuang Gan
Published in:
CoRR (2023)
Keyphrases
</>
model selection
statistical models
information systems
face recognition
training data
reinforcement learning
parameter estimation
statistical model
experimental data