High Dimensional Clustering on Large Data Sets.
Alexander HinneburgPublished in: EDBT PhD Workshop (2000)
Keyphrases
- high dimensional
- high dimensional data sets
- high dimensional data
- high dimensionality
- data points
- clustering algorithm
- data sets
- cluster analysis
- clustering method
- k means
- dimensionality reduction
- hierarchical clustering
- multi dimensional
- low dimensional
- unsupervised learning
- sparse data
- similarity search
- high dimensional datasets
- distance metric
- neural network
- gene expression data
- categorical data
- data reduction
- nearest neighbor search
- outlier detection
- kernel function
- data analysis
- search engine
- information retrieval
- real world
- information theoretic
- gene expression
- nearest neighbor
- data clustering
- semi supervised
- input space
- dissimilarity measure
- support vector machine
- data mining
- high dimensional data space