Login / Signup
TurboTransformers: an efficient GPU serving system for transformer models.
Jiarui Fang
Yang Yu
Chengduo Zhao
Jie Zhou
Published in:
PPoPP (2021)
Keyphrases
</>
real time
statistical models
prior knowledge
process model
databases
neural network
artificial intelligence
complex systems
parallel computation
processing units
mathematical models
power system
parameter estimation
model selection
general purpose
artificial neural networks
multiscale
image processing