World Knowledge as Indirect Supervision for Document Clustering.
Chenguang WangYangqiu SongDan RothMing ZhangJiawei HanPublished in: CoRR (2016)
Keyphrases
- document clustering
- world knowledge
- external knowledge
- bag of words
- text mining
- knowledge sources
- text documents
- background knowledge
- document collections
- clustering method
- clustering algorithm
- document representation
- active learning
- vector space model
- noun phrases
- tf idf
- k means
- semantic features
- computer vision
- cluster analysis
- knowledge discovery