A variant of n-gram based language- independent text categorization.
Jelena GraovacPublished in: Intell. Data Anal. (2014)
Keyphrases
- language independent
- text categorization
- n gram
- text classification
- cross language
- feature selection
- knn
- text documents
- bag of words
- language model
- machine learning
- text mining
- tf idf
- automatic summarization
- text classifiers
- term frequency
- text retrieval
- transfer learning
- k nearest neighbor
- language modeling
- labeled data
- unsupervised learning
- clustering method
- learning algorithm