Login / Signup
Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference?
Cheng Zhang
Jianyi Cheng
Ilia Shumailov
George A. Constantinides
Yiren Zhao
Published in:
CoRR (2023)
Keyphrases
</>
inference process
bayesian networks
probabilistic inference
inference engine
motion estimation
motion compensation
neural network
knowledge base
case study
parameter estimation
random fields
bayesian inference
probabilistic reasoning
inference mechanism
shape coding