Clustering of Short Strings in Large Databases.
Michail KazimianecArturas MazeikaPublished in: DEXA Workshops (2009)
Keyphrases
- data mining
- data mining tasks
- knowledge discovery
- cluster analysis
- clustering algorithm
- outlier detection
- categorical data
- databases
- clustering method
- self organizing maps
- data clustering
- unsupervised learning
- hierarchical clustering
- k means
- data analysis
- high dimensional data
- document clustering
- data objects
- spectral clustering
- anomaly detection
- information theoretic
- nearest neighbor
- data points
- edit distance
- cluster centers