Login / Signup

Edge Intelligence Optimization for Large Language Model Inference with Batching and Quantization.

Xinyuan ZhangJiang LiuZehui XiongYudong HuangGaochang XieRan Zhang
Published in: WCNC (2024)
Keyphrases