Semantically-Guided Clustering of Text Documents via Frequent Subgraphs Discovery.
Rafal A. AngrykMahmud Shahriar HossainBrandon NorickPublished in: ISMIS (2011)
Keyphrases
- text documents
- document clustering
- text mining
- text categorization
- text classification
- information extraction
- keywords
- frequent subgraphs
- wordnet
- clustering algorithm
- topic models
- graph mining
- clustering method
- knowledge discovery
- unsupervised learning
- k means
- bag of words
- k nearest neighbor
- data points
- semantic information
- named entities
- natural language
- real world
- co occurrence
- cluster analysis
- feature selection
- high dimensional data
- data analysis