Thorough Characterization and Analysis of Large Transformer Model Training At-Scale.
Scott ChengJun-Liang LinMurali EmaniSiddhisanket RaskarSam ForemanZhen XieVenkatram VishwanathMahmut T. KandemirPublished in: SIGMETRICS/Performance (Abstracts) (2024)
Keyphrases
- computational model
- mathematical model
- experimental data
- machine learning
- data sets
- statistical model
- theoretical framework
- probabilistic model
- theoretical analysis
- linear model
- process model
- parameter estimation
- markov chain
- least squares
- evolutionary algorithm
- expert systems
- objective function
- reinforcement learning
- high level
- image segmentation