Login / Signup
Gibbs Sampling from Human Feedback: A Provable KL- constrained Framework for RLHF.
Wei Xiong
Hanze Dong
Chenlu Ye
Han Zhong
Nan Jiang
Tong Zhang
Published in:
CoRR (2023)
Keyphrases
</>
gibbs sampling
computer vision
parameter estimation
image reconstruction
active learning
least squares
topic models