Dictionary-Based Voting Text Categorization in a Chemistry-Focused Search Engine.
Chunyan LiangLi GuoZhaojie XiaXiaoxia LiZhangyuan YangPublished in: WISE (2005)
Keyphrases
- text categorization
- search engine
- feature selection
- text classification
- multi label
- knn
- information gain
- web search engines
- semi supervised learning
- automated text categorization
- web search
- k nearest neighbor
- keywords
- naive bayes
- information retrieval
- reuters corpus
- user queries
- text documents
- text mining
- automatic text categorization
- external knowledge
- tf idf
- cross language
- unlabeled data
- machine learning
- term frequency
- text classifiers
- similarity measure
- feature vectors
- data mining