Login / Signup

System Optimizations for Enabling Training of Extreme Long Sequence Transformer Models.

Sam Ade JacobsMasahiro TanakaChengming ZhangMinjia ZhangReza Yazdani AminabadiShuaiwen Leon SongSamyam RajbhandariYuxiong He
Published in: IPDPS (Workshops) (2024)
Keyphrases
  • machine learning
  • fuzzy logic
  • accurate models
  • data sets
  • prior knowledge
  • active learning
  • probabilistic model
  • probability distribution
  • power system
  • experimental data
  • bayesian framework
  • classification models