Login / Signup

Memory Is All You Need: An Overview of Compute-in-Memory Architectures for Accelerating Large Language Model Inference.

Christopher WoltersXiaoxuan YangUlf SchlichtmannToyotaro Suzumura
Published in: CoRR (2024)
Keyphrases