Improving Semi-supervised Text Classification by Using Wikipedia Knowledge.
Zhilin ZhangHuaizhong LinPengfei LiHuazhong WangDongming LuPublished in: WAIM (2013)
Keyphrases
- text classification
- semi supervised
- labeled data
- semi supervised learning
- unlabeled data
- knowledge discovery
- domain knowledge
- co training
- text mining
- expert systems
- machine learning
- prior knowledge
- text data
- databases
- bag of words
- similarity measure
- knowledge sharing
- n gram
- text categorization
- information retrieval
- feature selection
- active learning
- knowledge acquisition
- data mining techniques
- principal component analysis
- knn