Login / Signup
Inference Optimization of Foundation Models on AI Accelerators.
Youngsuk Park
Kailash Budhathoki
Liangfu Chen
Jonas M. Kübler
Jiaji Huang
Matthäus Kleindessner
Jun Huan
Volkan Cevher
Yida Wang
George Karypis
Published in:
CoRR (2024)
Keyphrases
</>
optimization problems
complex systems
general purpose
ai systems
real time
machine learning
artificial intelligence
probabilistic model
parameter estimation
optimization algorithm
data sets
prior knowledge
case based reasoning
optimization process