An Efficient Sparse Inference Software Accelerator for Transformer-based Language Models on CPUs.

Published in: CoRR (2023)

Keyphrases