Login / Signup

Werewolf Arena: A Case Study in LLM Evaluation via Social Deduction.

Suma BailisJane FriedhoffFeiyang Chen
Published in: CoRR (2024)
Keyphrases
  • case study
  • social media
  • evaluation method
  • data sets
  • neural network
  • social interaction
  • evaluation methods
  • machine learning
  • social networks
  • search algorithm
  • theorem proving
  • user generated