Login / Signup

Localizing Lying in Llama: Understanding Instructed Dishonesty on True-False Questions Through Prompting, Probing, and Patching.

James CampbellRichard RenPhillip Guo
Published in: CoRR (2023)
Keyphrases
  • email
  • learning perl
  • subject matter
  • deeper understanding
  • database
  • neural network
  • information retrieval
  • case study
  • data model
  • answer questions
  • core concepts