Document classification based on web search hit counts.
Masaya KanekoShusuke OkamotoMasaki KohanaYou InayoshiPublished in: iiWAS (2012)
Keyphrases
- document classification
- web search
- text categorization
- search engine
- text classification
- text mining
- web documents
- search queries
- classification algorithm
- text documents
- linear classification
- web pages
- topic extraction
- automatic document classification
- data sets
- naive bayes
- k nearest neighbor
- information extraction
- bag of words
- knn
- classification accuracy
- feature selection
- computer vision
- databases