Clustering high dimensional data: examining differences and commonalities between subspace clustering and text clustering - a position paper.
Hans-Peter KriegelEirini NtoutsiPublished in: SIGKDD Explor. (2013)
Keyphrases
- clustering high dimensional data
- text clustering
- subspace clustering
- clustering algorithm
- clustering method
- high dimensional data
- document clustering
- text data
- hierarchical clustering
- high dimensional
- k means
- text mining
- subspace clusters
- text categorization
- data clustering
- self organizing maps
- high dimensionality
- background knowledge
- clustering quality
- low dimensional
- wordnet
- data sets
- cluster analysis
- data mining
- machine learning
- data analysis
- feature space
- pairwise
- metric learning
- image data
- nearest neighbor
- natural language processing
- hierarchical structure
- text documents
- co occurrence
- dimensionality reduction
- generative model