Text Clustering Using a Suffix Tree Similarity Measure.
Chenghui HuangJian YinFang HouPublished in: J. Comput. (2011)
Keyphrases
- text clustering
- suffix tree
- similarity measure
- data structure
- text mining
- document clustering
- hierarchical clustering
- text data
- clustering method
- clustering algorithm
- text categorization
- background knowledge
- pattern matching
- k means
- metric learning
- index structure
- pairwise
- text collections
- self organizing maps
- euclidean distance
- similarity search
- text documents
- wordnet
- inverted index
- data sets
- semantic similarity
- document representation
- document collections
- text classification
- information extraction
- information retrieval
- data mining