Login / Signup

CQIL: Inference Latency Optimization with Concurrent Computation of Quasi-Independent Layers.

Longwei ZouQingyang WangHan ZhaoJiangang KongYi YangYangdong Deng
Published in: CoRR (2024)
Keyphrases