Login / Signup

Grass: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients.

Aashiq MuhamedOscar LiDavid WoodruffMona DiabVirginia Smith
Published in: CoRR (2024)
Keyphrases
  • low memory
  • denoising
  • computationally efficient