A Comparative Study on Representing Units in Chinese Text Clustering.
Hongjun WangShiwen YuXueqiang LvShuicai ShiShibin XiaoPublished in: KSEM (2006)
Keyphrases
- text clustering
- text mining
- clustering algorithm
- document clustering
- k means
- hierarchical clustering
- text categorization
- background knowledge
- user feedback
- text data
- wordnet
- self organizing maps
- text documents
- text classification
- text collections
- vector space model
- document representation
- latent semantic analysis
- clustering quality
- neural network
- hierarchical structure
- natural language processing
- knn
- metric learning
- natural language
- information retrieval