A Survey on Information Retrieval, Text Categorization, and Web Crawling
Youssef BassilPublished in: CoRR (2012)
Keyphrases
- text categorization
- web crawling
- information retrieval
- term weighting
- text collections
- tf idf
- feature selection
- text classification
- knn
- search engine
- k nearest neighbor
- information extraction
- text mining
- information retrieval systems
- text documents
- document collections
- document retrieval
- text classifiers
- language model
- data sets
- machine learning
- web search
- vector space model
- natural language processing
- term frequency
- nearest neighbor
- similarity measure
- real world