Keyphrases
- text classification
- tf idf
- text categorization
- text documents
- term weighting
- term frequency
- document frequency
- feature weighting
- weighting scheme
- bag of words
- n gram
- feature selection
- text mining
- language modeling
- text data
- naive bayes
- labeled data
- multi label
- document clustering
- document representation
- text classifiers
- dimensionality reduction
- vector space
- retrieval model
- information retrieval
- machine learning
- ranking algorithm
- knn
- co occurrence
- unlabeled data
- k nearest neighbor
- distance measure
- semantic features
- information extraction
- data cleaning
- feature reduction