Scalable Model-Based Clustering for Large Databases Based on Data Summarization.
Huidong JinMan Leung WongKwong-Sak LeungPublished in: IEEE Trans. Pattern Anal. Mach. Intell. (2005)
Keyphrases
- data summarization
- model based clustering
- data mining
- mixture model
- data analysis
- hierarchical clustering
- em algorithm
- expectation maximization
- databases
- multidimensional data
- stream processing
- knowledge discovery
- bayesian information criterion
- document clustering
- data sets
- k means
- association rules
- vector space model
- gaussian mixture model
- machine learning
- similarity search
- high dimensional
- data processing
- dimensionality reduction
- text mining