An Approach to Indexing and Clustering News Stories Using Continuous Language Models.
Richard BacheFabio CrestaniPublished in: NLDB (2010)
Keyphrases
- language model
- news stories
- information retrieval
- language modeling
- news articles
- probabilistic model
- document retrieval
- query expansion
- retrieval model
- n gram
- clustering algorithm
- clustering method
- smoothing methods
- test collection
- relevance model
- language models for information retrieval
- text retrieval
- query terms
- k means
- pseudo relevance feedback
- unsupervised learning
- vector space model
- multimedia
- news video
- out of vocabulary
- document clustering
- machine learning
- knowledge discovery
- document representation
- retrieved documents
- text mining
- query specific
- information retrieval systems