Transformer-based Causal Language Models Perform Clustering.
Xinbo WuLav R. VarshneyPublished in: CoRR (2024)
Keyphrases
- language model
- language modeling
- probabilistic model
- document retrieval
- n gram
- clustering algorithm
- speech recognition
- clustering method
- retrieval model
- language modelling
- information retrieval
- pseudo relevance feedback
- query expansion
- k means
- context sensitive
- mixture model
- smoothing methods
- statistical language models
- spectral clustering
- query terms
- document ranking
- query specific
- bayesian networks
- ad hoc information retrieval
- data clustering
- vector space model
- test collection
- word error rate
- hierarchical clustering
- retrieval effectiveness
- document clustering
- agglomerative clustering
- language modeling framework
- language model for information retrieval