Multinomial mixture model with feature selection for text clustering.
Minqiang LiLiang ZhangPublished in: Knowl. Based Syst. (2008)
Keyphrases
- text clustering
- mixture model
- text categorization
- feature selection
- em algorithm
- text classification
- expectation maximization
- text data
- probabilistic model
- model selection
- dirichlet distribution
- unsupervised learning
- text documents
- gaussian mixture model
- k means
- naive bayes
- generative model
- text mining
- maximum likelihood
- density estimation
- document clustering
- clustering algorithm
- knn
- machine learning
- semi supervised learning
- background knowledge
- language model
- probability density function
- hierarchical clustering
- data mining
- image processing
- bag of words
- dimensionality reduction
- high dimensional
- metric learning
- document representation
- discrete data
- multi class
- feature extraction
- information extraction
- data analysis
- tf idf
- clustering quality
- image segmentation
- reinforcement learning