Using cluster validation criterion to identify optimal feature subset and cluster number for document clustering.
Zheng-Yu NiuDong-Hong JiChew Lim TanPublished in: Inf. Process. Manag. (2007)
Keyphrases
- document clustering
- cluster validation
- feature subset
- cluster validity
- clustering algorithm
- feature selection
- clustering quality
- feature subset selection
- cluster analysis
- document clusters
- k means
- classification accuracy
- support vector machine
- document collections
- data sets
- association rules
- knowledge base
- neural network