Are n-gram Categories Helpful in Text Classification?
Jakub KruczekPaulina KruczekMarcin KutaPublished in: ICCS (2) (2020)
Keyphrases
- n gram
- text classification
- training documents
- bag of words
- language independent
- language modeling
- text categorization
- variable length
- language modelling
- naive bayes
- text mining
- automatic text classification
- feature selection
- machine learning
- language model
- viterbi algorithm
- text classifiers
- part of speech
- knn
- text data
- labeled data
- text documents
- term frequency
- cross lingual
- data mining
- multi label
- word segmentation
- statistical language modeling
- artificial intelligence
- inside outside algorithm
- semantic features
- unlabeled data
- co occurrence
- search engine