Login / Signup
Splitwise: Efficient Generative LLM Inference Using Phase Splitting.
Pratyush Patel
Esha Choukse
Chaojie Zhang
Aashaka Shah
Íñigo Goiri
Saeed Maleki
Ricardo Bianchini
Published in:
ISCA (2024)
Keyphrases
</>
artificial intelligence
efficient learning
database
data mining
computationally efficient
highly efficient
neural network
information retrieval
information systems
generative model