Login / Signup
On Optimal Caching and Model Multiplexing for Large Model Inference.
Banghua Zhu
Ying Sheng
Lianmin Zheng
Clark W. Barrett
Michael I. Jordan
Jiantao Jiao
Published in:
CoRR (2023)
Keyphrases
</>
computational model
high level
mathematical model
probabilistic model
theoretical analysis
closed form
bayesian model
decision making
bayesian networks
theoretical framework
statistical model
experimental data
network structure
sensitivity analysis
prediction model