Parallel clustering of high-dimensional social media data streams.
Xiaoming GaoEmilio FerraraJudy QiuPublished in: CoRR (2015)
Keyphrases
- streaming data
- data streams
- social media
- high dimensional
- sliding window
- concept drift
- outlier detection
- clustering algorithm
- high dimensionality
- multiple data streams
- high dimensional datasets
- data points
- high dimensional data
- clustering method
- evolving data streams
- categorical data
- continuous data streams
- social networks
- data sets
- low dimensional
- high dimensional data sets
- unsupervised learning
- nearest neighbor
- stream data
- similarity search
- document clustering
- hierarchical clustering
- stream processing
- multi dimensional
- incoming data
- sensor networks
- categorical attributes
- fuzzy clustering
- big data
- feature space
- sensor data
- k means
- high dimensional data space
- dimensionality reduction
- parallel processing
- spectral clustering
- cluster analysis
- itemsets