Login / Signup

Zeroth-Order Optimization Meets Human Feedback: Provable Learning via Ranking Oracles.

Zhiwei TangDmitry RybinTsung-Hui Chang
Published in: CoRR (2023)
Keyphrases