Login / Signup
LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization.
Juntao Zhao
Borui Wan
Yanghua Peng
Haibin Lin
Chuan Wu
Published in:
CoRR (2024)
Keyphrases
</>
adaptive quantization
rate distortion
shape coding
disjoint clusters
low bit rate
data points
clustering algorithm
computer vision
image compression
three dimensional
bit rate
subband coding
similarity measure
subband
video coding