Topical N-Grams: Phrase and Topic Discovery, with an Application to Information Retrieval.
Xuerui WangAndrew McCallumXing WeiPublished in: ICDM (2007)
Keyphrases
- n gram
- topic discovery
- text classification
- information retrieval
- language model
- topic models
- language modeling
- document level
- topic modeling
- text mining
- text analysis
- latent dirichlet allocation
- language independent
- bag of words
- keywords
- text documents
- relevance ranking
- document retrieval
- search engine
- test collection
- word level
- query expansion
- retrieval model
- variable length
- probabilistic model
- information extraction
- part of speech
- lda model
- knn
- web documents
- document collections
- machine learning
- information retrieval systems
- artificial intelligence
- feature selection
- data analysis
- text categorization
- information access
- digital libraries
- natural language
- text retrieval
- relevant documents