C
search
search
reviewers
reviewers
feeds
feeds
assignments
assignments
settings
logout
Gibbs Sampling from Human Feedback: A Provable KL- constrained Framework for RLHF.
Wei Xiong
Hanze Dong
Chenlu Ye
Han Zhong
Nan Jiang
Tong Zhang
Published in:
CoRR (2023)
Keyphrases
</>
gibbs sampling
computer vision
parameter estimation
image reconstruction
active learning
least squares
topic models