Login / Signup
Metron: Holistic Performance Evaluation Framework for LLM Inference Systems.
Amey Agrawal
Anmol Agarwal
Nitin Kedia
Jayashree Mohan
Souvik Kundu
Nipun Kwatra
Ramachandran Ramjee
Alexey Tumanov
Published in:
CoRR (2024)
Keyphrases
</>
main contribution
inference process
theoretical framework
distributed systems
intelligent systems
computer systems
probabilistic inference
neural network
learning environment
graphical models
belief networks