Login / Signup

Edge Intelligence Optimization for Large Language Model Inference with Batching and Quantization.

Xinyuan ZhangJiang LiuZehui XiongYudong HuangGaochang XieRan Zhang
Published in: CoRR (2024)
Keyphrases