Sign in

nuQmm: Quantized MatMul for Efficient Inference of Large-Scale Generative Language Models.

Gunho ParkBaeseong ParkSe Jung KwonByeongwook KimYoungjoo LeeDongsoo Lee
Published in: CoRR (2022)
Keyphrases