Login / Signup

Data Mixture Inference: What do BPE Tokenizers Reveal about their Training Data?

Jonathan HayaseAlisa LiuYejin ChoiSewoong OhNoah A. Smith
Published in: CoRR (2024)
Keyphrases