Login / Signup
Memory Is All You Need: An Overview of Compute-in-Memory Architectures for Accelerating Large Language Model Inference.
Christopher Wolters
Xiaoxuan Yang
Ulf Schlichtmann
Toyotaro Suzumura
Published in:
CoRR (2024)
Keyphrases
</>
language model
language modeling
probabilistic model
document retrieval
n gram
retrieval model
information retrieval
speech recognition
test collection
statistical language models
context sensitive
bayesian networks
graphical models
relevance model