The Two Word Test: A Semantic Benchmark for Large Language Models.
Nicholas RiccardiRutvik H. DesaiPublished in: CoRR (2023)
Keyphrases
- language model
- n gram
- translation model
- language modeling
- probabilistic model
- multiword
- speech recognition
- language modelling
- test collection
- statistical language modeling
- out of vocabulary
- information retrieval
- word error rate
- document retrieval
- query expansion
- retrieval model
- word clouds
- vector space model
- co occurrence
- language independent
- retrieval effectiveness
- word pairs
- word segmentation
- context sensitive
- spoken term detection
- ad hoc information retrieval
- smoothing methods
- document ranking
- statistical language models
- language model for information retrieval
- relevance model
- term weighting
- semantic similarity
- semantic information
- natural language
- document level
- word sense disambiguation
- query terms
- hidden markov models