Login / Signup
NeuPIMs: NPU-PIM Heterogeneous Acceleration for Batched LLM Inferencing.
Guseul Heo
Sangyeop Lee
Jaehong Cho
Hyunmin Choi
Sanghyeon Lee
Hyungkyu Ham
Gwangsun Kim
Divya Mahajan
Jongse Park
Published in:
ASPLOS (3) (2024)
Keyphrases
</>
knowledge representation
heterogeneous networks
data sets
real world
database
machine learning
genetic algorithm
inference engine
semantically rich