Sign in

Data Selection for Language Models via Importance Resampling.

Sang Michael XieShibani SanturkarTengyu MaPercy Liang
Published in: CoRR (2023)
Keyphrases
  • language model
  • n gram
  • knowledge discovery
  • document retrieval
  • probabilistic model
  • speech recognition
  • information retrieval
  • training data