Using Shape Patterns for Clustering Unstructured Text Documents.
Durga ToshniwalRishiraj Saha RoyPublished in: SEDE (2009)
Keyphrases
- text documents
- document clustering
- text clustering
- text mining
- text classification
- text categorization
- information extraction
- keywords
- wordnet
- clustering algorithm
- extraction patterns
- text data
- bag of words
- named entities
- clustering method
- k means
- topic models
- text representation
- unsupervised learning
- text collections
- pattern discovery
- neural network
- pattern mining
- sequential patterns
- structured data
- semi supervised learning
- textual data
- data points
- data analysis
- image processing
- information extraction systems
- databases
- automatic text categorization