Enhancing text clustering by leveraging Wikipedia semantics.
Jian HuLujun FangYang CaoHua-Jun ZengHua LiQiang YangZheng ChenPublished in: SIGIR (2008)
Keyphrases
- text clustering
- wordnet
- semantic information
- document representation
- text mining
- semantic relations
- hierarchical clustering
- document clustering
- background knowledge
- document collections
- text categorization
- clustering algorithm
- text data
- text collections
- user feedback
- text documents
- knowledge base
- self organizing maps
- named entities
- metric learning
- natural language processing
- vector space model
- co occurrence
- collaborative filtering
- k means
- feature selection
- information retrieval
- machine learning
- search engine
- dimensionality reduction