Modifying a Simplified EM Algorithm for Text Clustering using a SM based Operation on String Vectors.
Taeho JoMalrey LeeHyogun YoonChulgyu SongPublished in: DMIN (2007)
Keyphrases
- em algorithm
- text clustering
- expectation maximization
- text mining
- k means
- maximum likelihood
- mixture model
- document clustering
- clustering algorithm
- text categorization
- hierarchical clustering
- gaussian mixture model
- background knowledge
- generative model
- text documents
- text collections
- wordnet
- user feedback
- text data
- clustering quality
- self organizing maps
- metric learning
- probability density function
- density estimation
- information retrieval
- feature vectors
- clustering method
- topic models
- document collections
- input data
- feature selection