Login / Signup
A Comprehensive Survey of LLM Alignment Techniques: RLHF, RLAIF, PPO, DPO and More.
Zhichao Wang
Bin Bi
Shiva Kumar Pentyala
Kiran Ramnath
Sougata Chaudhuri
Shubham Mehrotra
Zixu Zhu
Xiang-Bo Mao
Sitaram Asur
Na Cheng
Published in:
CoRR (2024)
Keyphrases
</>
image alignment
evolutionary algorithm
machine learning
knowledge base
multi agent
online learning
dynamic time warping
word alignment