A cross-collection mixture model for comparative text mining.
ChengXiang ZhaiAtulya VelivelliBei YuPublished in: KDD (2004)
Keyphrases
- mixture model
- text mining
- em algorithm
- gaussian mixture model
- density estimation
- generative model
- model selection
- probabilistic model
- expectation maximization
- information extraction
- language model
- natural language processing
- unsupervised learning
- biomedical literature
- machine learning
- text documents
- maximum likelihood
- text classification
- data analysis
- probability density function
- information retrieval
- mixture modeling
- knowledge discovery
- topic models
- data sets
- document collections
- image segmentation
- gaussian process
- finite mixture model
- dirichlet processes
- probabilistic mixture model