Language Independent n-Gram-Based Text Categorization with Weighting Factors: A Case Study.
Jelena GraovacJovana J. KovacevicGordana Pavlovic-LazeticPublished in: J. Inf. Data Manag. (2015)
Keyphrases
- text categorization
- language independent
- weighting factors
- n gram
- text classification
- cross language
- bag of words
- feature selection
- knn
- language model
- term frequency
- text documents
- language modeling
- text retrieval
- k nearest neighbor
- visual words
- machine translation
- text mining
- labeled data
- machine learning
- cross lingual
- tf idf
- transfer learning
- neural network
- search engine
- unlabeled data
- semi supervised learning
- artificial intelligence
- text classifiers
- digital libraries