Login / Signup

Regurgitative Training: The Value of Real Data in Training Large Language Models.

Jinghui ZhangDandan QiaoMochen YangQiang Wei
Published in: CoRR (2024)
Keyphrases
  • language model
  • language modeling
  • training set
  • probabilistic model
  • speech recognition
  • error rate
  • n gram
  • document retrieval
  • context sensitive
  • statistical language models