Login / Signup

ITIF: Integrated Transformers Inference Framework for Multiple Tenants on GPU.

Yuning ZhangZao ZhangWei BaoDong Yuan
Published in: ICPP (2023)
Keyphrases
  • database
  • parallel processing
  • inference process
  • integrating multiple
  • real time
  • bayesian networks
  • general purpose