Exploring Mathematical Extrapolation of Large Language Models with Synthetic Data.
Haolong LiYu MaYinqi ZhangChen YeJie ChenPublished in: CoRR (2024)
Keyphrases
- synthetic data
- language model
- language modeling
- document retrieval
- n gram
- language modelling
- query expansion
- probabilistic model
- retrieval model
- speech recognition
- information retrieval
- real world
- test collection
- data sets
- statistical language models
- ad hoc information retrieval
- real image data
- smoothing methods
- language model for information retrieval
- context sensitive
- query terms
- pseudo relevance feedback
- vector space model
- database
- translation model
- query specific
- term dependencies
- synthetic datasets
- speech signal
- document length
- okapi bm
- language modeling approaches
- spoken term detection