Semi-supervised Document Clustering with Simultaneous Text Representation and Categorization.
Yanhua ChenLijun WangMing DongPublished in: ECML/PKDD (1) (2009)
Keyphrases
- text representation
- text categorization
- document clustering
- text classification
- text documents
- information filtering
- concept learning
- index terms
- text clustering
- knn
- text mining
- bag of words
- keywords
- vector space model
- text retrieval
- feature selection
- k nearest neighbor
- document representation
- semi supervised learning
- text collections
- web documents
- language model
- learning process
- feature vectors
- information retrieval
- instance level constraints