Login / Signup
Provably Sample Efficient RLHF via Active Preference Optimization.
Nirjhar Das
Souradip Chakraborty
Aldo Pacchiano
Sayak Ray Chowdhury
Published in:
CoRR (2024)
Keyphrases
</>
computationally efficient
optimization algorithm
cost effective
multi objective
sample size
efficient optimization