Login / Signup

How do Large Language Models Navigate Conflicts between Honesty and Helpfulness?

Ryan LiuTheodore R. SumersIshita DasguptaThomas L. Griffiths
Published in: CoRR (2024)
Keyphrases