LLMCompass: Enabling Efficient Hardware Design for Large Language Model Inference.
Hengrui ZhangAugust NingRohan Baskar PrabhakarDavid WentzlaffPublished in: ISCA (2024)
Keyphrases
- language model
- hardware design
- language modeling
- n gram
- document retrieval
- information retrieval
- speech recognition
- probabilistic model
- language modelling
- mixture model
- query expansion
- retrieval model
- hardware implementation
- statistical language models
- relevance model
- test collection
- generative model
- bayesian inference
- co occurrence
- pseudo relevance feedback
- automatic speech recognition
- web search
- language model for information retrieval