Login / Signup

System Optimizations for Enabling Training of Extreme Long Sequence Transformer Models.

Sam Ade JacobsMasahiro TanakaChengming ZhangMinjia ZhangReza Yazdani AminadabiShuaiwen Leon SongSamyam RajbhandariYuxiong He
Published in: PODC (2024)
Keyphrases