Parallelizing Clustering of Geoscientific Data Sets using Data Streams.
Silvia NittelKelvin T. LeungPublished in: SSDBM (2004)
Keyphrases
- data streams
- data sets
- clustering algorithm
- multiple data streams
- outlier detection
- clustering method
- sliding window
- high dimensional data
- mixed data
- sensor networks
- unsupervised learning
- cluster analysis
- high dimensional data sets
- data clustering
- continuous queries
- k means
- self organizing maps
- information theoretic
- streaming data
- validity indices
- database
- categorical attributes
- real world data sets
- categorical data
- concept drift
- spectral clustering
- document clustering
- data distribution
- benchmark data sets
- fuzzy clustering
- stream data
- large scale data sets
- multi dimensional data
- input data
- continuous data streams
- data points
- training data