Login / Signup

Dataset Decomposition: Faster LLM Training with Variable Sequence Length Curriculum.

Hadi PouransariChun-Liang LiJen-Hao Rick ChangPavan Kumar Anasosalu VasuCem KocVaishaal ShankarOncel Tuzel
Published in: CoRR (2024)
Keyphrases