Login / Signup
Localizing Lying in Llama: Understanding Instructed Dishonesty on True-False Questions Through Prompting, Probing, and Patching.
James Campbell
Richard Ren
Phillip Guo
Published in:
CoRR (2023)
Keyphrases
</>
email
learning perl
subject matter
deeper understanding
database
neural network
information retrieval
case study
data model
answer questions
core concepts