DeepSpeed- Inference: Enabling Efficient Inference of Transformer Models at Unprecedented Scale.
Reza Yazdani AminabadiSamyam RajbhandariAmmar Ahmad AwanCheng LiDu LiElton ZhengOlatunji RuwaseShaden SmithMinjia ZhangJeff RasleyYuxiong HePublished in: SC (2022)
Keyphrases
- efficient inference
- probabilistic inference
- structured prediction
- fully connected
- exact inference
- human pose estimation
- hidden variables
- conditional random fields
- factor graphs
- bayesian networks
- approximate inference
- probabilistic model
- pairwise
- linear models
- dynamic bayesian networks
- parameter estimation
- random fields
- bayesian inference
- generative model
- model selection
- markov networks
- graphical models
- higher order
- prior knowledge
- learning algorithm