Research on Improvement of N-grams Based Text Classification by Applying Pointwise Mutual Information Measures.
Tsvetanka Georgieva-TrifonovaPublished in: Balt. J. Mod. Comput. (2021)
Keyphrases
- n gram
- text classification
- pointwise mutual information
- bag of words
- feature selection
- language independent
- machine learning
- language modeling
- text categorization
- variable length
- language modelling
- language model
- labeled data
- part of speech
- text mining
- viterbi algorithm
- unlabeled data
- naive bayes
- knn
- text documents
- relevance ranking
- inside outside algorithm
- text classifiers
- cross lingual
- computer vision