Clustering by pattern similarity in large data sets.
Haixun WangWei WangJiong YangPhilip S. YuPublished in: SIGMOD Conference (2002)
Keyphrases
- similarity function
- clustering algorithm
- data sets
- similar patterns
- distance metric
- dissimilarity measure
- similarity calculation
- similarity measure
- clustering method
- measuring similarity
- high similarity
- k means
- cosine similarity
- data clustering
- distance measure
- pattern matching
- euclidean distance
- similarity assessment
- similarity computation
- multidimensional scaling
- structural similarity
- hierarchical clustering
- pattern extraction
- web sessions
- alternative clusterings
- fuzzy clustering
- similarity measurement
- cluster analysis
- data points
- website
- data reduction
- pattern discovery
- similar objects
- semantic similarity
- data objects
- document clustering
- information theoretic
- self organizing maps
- criterion function
- distance function
- content similarity
- data analysis