Embers of Autoregression: Understanding Large Language Models Through the Problem They are Trained to Solve.
R. Thomas McCoyShunyu YaoDan FriedmanMatthew HardyThomas L. GriffithsPublished in: CoRR (2023)
Keyphrases
- language model
- language modeling
- n gram
- speech recognition
- probabilistic model
- document retrieval
- information retrieval
- retrieval model
- ad hoc information retrieval
- test collection
- query expansion
- statistical language models
- query terms
- smoothing methods
- language modelling
- context sensitive
- web search
- pseudo relevance feedback
- training set
- language models for information retrieval
- machine learning
- document ranking
- term dependencies