A new suffix tree similarity measure for document clustering.
Hung ChimXiaotie DengPublished in: WWW (2007)
Keyphrases
- document clustering
- suffix tree
- similarity measure
- clustering method
- cosine similarity
- data structure
- document collections
- clustering algorithm
- text mining
- pattern matching
- text documents
- index structure
- vector space model
- document representation
- distance measure
- document clusters
- pairwise
- document similarity
- similarity search
- k means
- multi dimensional
- semi supervised
- query processing
- databases