Dictionary-based text categorization of chemical web pages.
Chunyan LiangLi GuoZhaojie XiaFeng-Guang NieXiaoxia LiLiang SuZhangyuan YangPublished in: Inf. Process. Manag. (2006)
Keyphrases
- text categorization
- web pages
- website
- search engine
- feature selection
- text classification
- knn
- multi label
- k nearest neighbor
- text documents
- information gain
- web content
- naive bayes
- web documents
- web search engines
- reuters corpus
- web search
- text classifiers
- cross language information retrieval
- automated text categorization
- automatic text categorization
- keywords
- tf idf
- term frequency
- semi supervised learning
- feature selections
- data sets
- information theoretic
- text mining
- user intent
- multi instance multi label learning