Sparse Fine-tuning for Inference Acceleration of Large Language Models.
Eldar KurticDenis KuznedelevElias FrantarMichael GoinDan AlistarhPublished in: CoRR (2023)
Keyphrases
- language model
- fine tuning
- language modeling
- n gram
- viable alternative
- document retrieval
- speech recognition
- probabilistic model
- retrieval model
- information retrieval
- query expansion
- fine tune
- language models for information retrieval
- test collection
- statistical language models
- ad hoc information retrieval
- vector space model
- query terms
- mixture model
- language modelling
- bayesian networks
- context sensitive
- smoothing methods
- okapi bm
- fine tuned
- document length
- relevance model
- term dependencies
- document ranking
- expert search
- translation model
- text retrieval
- high dimensional
- spoken term detection
- language model for information retrieval