Hierarchical vs. flat n-gram-based text categorization: Can we do better?
Jelena GraovacJovana J. KovacevicGordana Pavlovic-LazeticPublished in: Comput. Sci. Inf. Syst. (2017)
Keyphrases
- text categorization
- hierarchical text categorization
- text classification
- knn
- feature selection
- multi label
- n gram
- text classifiers
- k nearest neighbor
- information gain
- document categorization
- reuters corpus
- semi supervised learning
- automated text categorization
- text documents
- naive bayes
- cross language
- feature weighting
- feature selection for text categorization
- machine learning
- tf idf
- information retrieval systems
- term selection
- prior knowledge
- data analysis
- feature selections
- data sets