Login / Signup

HeteGen: Heterogeneous Parallel Inference for Large Language Models on Resource-Constrained Devices.

Xuanlei ZhaoBin JiaHaotian ZhouZiming LiuShenggan ChengYang You
Published in: CoRR (2024)
Keyphrases