Preprocessing Techniques in Text Categorization: A Survey.
Sayyam MalikSana Ahmad SaniAnees BaqirUsman AhmadFaizan ul MustafaPublished in: INTAP (2019)
Keyphrases
- text categorization
- preprocessing
- feature selection
- knn
- text classification
- multi label
- naive bayes
- document classification
- reuters corpus
- information gain
- text documents
- automated text categorization
- semi supervised learning
- k nearest neighbor
- feature weighting
- unlabeled data
- feature extraction
- tf idf
- text classifiers
- document categorization
- semantic browsing
- text collections
- term frequency
- automatic text categorization
- object recognition
- similarity measure
- decision trees
- neural network
- data sets