MECLA: Memory-Compute-Efficient LLM Accelerator with Scaling Sub-matrix Partition.

Published in: ISCA (2024)

Keyphrases