Login / Signup

Aligning as Debiasing: Causality-Aware Alignment via Reinforcement Learning with Interventional Feedback.

Yu XiaTong YuZhankui HeHandong ZhaoJulian J. McAuleyShuai Li
Published in: NAACL-HLT (2024)
Keyphrases