Frequency Explains the Inverse Correlation of Large Language Models' Size, Training Data Amount, and Surprisal's Fit to Reading Times.
Byung-Doh OhShisen YueWilliam SchulerPublished in: CoRR (2024)
Keyphrases
- language model
- training data
- language modeling
- n gram
- document retrieval
- probabilistic model
- information retrieval
- retrieval model
- query expansion
- context sensitive
- speech recognition
- language modelling
- decision trees
- statistical language models
- test collection
- smoothing methods
- ad hoc information retrieval
- vector space model
- learning algorithm
- document ranking
- query terms
- classification accuracy
- term dependencies
- translation model
- language models for information retrieval
- query specific
- training set