Experimenting N-Grams in Text Categorization.
Abdellatif RahmounZakaria ElberrichiPublished in: Int. Arab J. Inf. Technol. (2007)
Keyphrases
- text categorization
- n gram
- text classification
- bag of words
- language independent
- language model
- knn
- text documents
- k nearest neighbor
- feature selection
- language modeling
- naive bayes
- information gain
- part of speech
- text classifiers
- text mining
- machine learning
- term frequency
- viterbi algorithm
- tf idf
- unlabeled data
- semi supervised learning
- unsupervised learning
- cross language
- labeled data
- web documents
- learning algorithm