Login / Signup
MECLA: Memory-Compute-Efficient LLM Accelerator with Scaling Sub-matrix Partition.
Yubin Qin
Yang Wang
Zhiren Zhao
Xiaolong Yang
Yang Zhou
Shaojun Wei
Yang Hu
Shouyi Yin
Published in:
ISCA (2024)
Keyphrases
</>
limited memory
compute intensive
computationally expensive
singular value decomposition
memory requirements
data mining
information retrieval
genetic algorithm
data structure
cost effective
efficient implementation
parallel implementation
memory usage