The impact of preprocessing on text classification.
Alper Kursat UysalSerkan GünalPublished in: Inf. Process. Manag. (2014)
Keyphrases
- text classification
- preprocessing
- text mining
- bag of words
- text categorization
- n gram
- machine learning
- text documents
- naive bayes
- unlabeled data
- preprocessing phase
- semantic features
- text data
- feature selection
- post processing
- labeled data
- k nearest neighbor
- multi label
- feature extraction
- sentiment analysis
- text classifiers
- data cleaning
- databases
- action recognition
- training data
- document classification
- data mining