Parallel Clustering of High-Dimensional Social Media Data Streams.
Xiaoming GaoEmilio FerraraJudy QiuPublished in: CCGRID (2015)
Keyphrases
- data streams
- high dimensional
- social media
- high dimensional data
- clustering algorithm
- data points
- sliding window
- high dimensionality
- k means
- sensor networks
- streaming data
- clustering method
- multiple data streams
- low dimensional
- similarity search
- outlier detection
- nearest neighbor
- dimensionality reduction
- multi dimensional
- data clustering
- self organizing maps
- fuzzy clustering
- high dimensional datasets
- unsupervised learning
- multi dimensional data
- data sets
- continuous data
- stream processing
- anytime classification
- categorical data
- big data
- microarray data
- document clustering
- gene expression data
- knn
- feature space
- feature selection