• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

An Efficient Sparse Inference Software Accelerator for Transformer-based Language Models on CPUs.

Haihao ShenHengyu MengBo DongZhe WangOfir ZafrirYi DingYu LuoHanwen ChangQun GaoZiheng WangGuy BoudoukhMoshe Wasserblat
Published in: CoRR (2023)
Keyphrases