Login / Signup

D2O: Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models.

Zhongwei WanXinjian WuYu ZhangYi XinChaofan TaoZhihong ZhuXin WangSiqi LuoJing XiongMi Zhang
Published in: CoRR (2024)
Keyphrases