Topic Models For Feature Selection in Document Clustering.
Anna DrummondChris JermaineZografoula VagenaPublished in: SDM (2013)
Keyphrases
- document clustering
- topic models
- text documents
- feature selection
- text mining
- topic modeling
- latent dirichlet allocation
- text classification
- text categorization
- document representation
- tf idf
- negative matrix factorization
- co occurrence
- text analysis
- probabilistic model
- machine learning
- relevance model
- vector space model
- generative model
- natural language processing
- data mining
- artificial intelligence
- feature extraction
- document collections
- model selection
- knowledge discovery
- k means
- clustering method
- feature space
- news articles
- information retrieval
- support vector machine
- unsupervised learning
- object recognition
- support vector
- prior knowledge
- clustering algorithm
- information extraction