A Probabilistic Model for Clustering Text Documents with Multiple Fields.
Shanfeng ZhuIchigaku TakigawaShuqin ZhangHiroshi MamitsukaPublished in: ECIR (2007)
Keyphrases
- text documents
- probabilistic model
- document clustering
- text mining
- text clustering
- topic models
- text categorization
- document classification
- information extraction
- text classification
- k means
- wordnet
- tf idf
- keywords
- clustering algorithm
- text collections
- text representation
- automatic text categorization
- comparative sentences
- bag of words
- unsupervised learning
- bayesian networks
- information retrieval
- clustering method
- data sets
- language model
- supervised learning
- text data
- active learning
- feature vectors
- image segmentation
- relevant concepts
- computer vision