Login / Signup

Benchmark Data Contamination of Large Language Models: A Survey.

Cheng XuShuhao GuanDerek GreeneM. Tahar Kechadi
Published in: CoRR (2024)
Keyphrases
  • language model
  • knowledge discovery
  • language modeling
  • training data
  • data streams
  • xml documents
  • hidden markov models
  • speech recognition
  • test collection
  • uncertain data