Login / Signup
NeuPIMs: NPU-PIM Heterogeneous Acceleration for Batched LLM Inferencing.
Guseul Heo
Sangyeop Lee
Jaehong Cho
Hyunmin Choi
Sanghyeon Lee
Hyungkyu Ham
Gwangsun Kim
Divya Mahajan
Jongse Park
Published in:
CoRR (2024)
Keyphrases
</>
knowledge representation
objective function
heterogeneous data
heterogeneous networks
data sets
three dimensional