A SOM-Based Document Clustering Using Frequent Max Substrings for Non-Segmented Texts.
Todsanai ChumwatanaKok Wai WongHong XiePublished in: J. Intell. Learn. Syst. Appl. (2010)
Keyphrases
- document clustering
- text documents
- k means
- terminology extraction
- self organizing maps
- clustering algorithm
- text mining
- cluster analysis
- negative matrix factorization
- document collections
- clustering method
- document representation
- topic extraction
- topic detection
- document clusters
- neural network
- document similarity
- text classification
- keywords
- bag of words
- information extraction
- document classification
- text categorization
- wordnet
- topic models
- ant based clustering
- tolerance rough set
- unsupervised learning
- high dimensional
- clustering approaches
- supervised learning
- feature selection
- information retrieval