A Solution of the Multiaspect Text Categorization Problem by a Hybrid HMM and LDA Based Technique.
Slawomir ZadroznyJanusz KacprzykMarek GajewskiPublished in: IPMU (1) (2016)
Keyphrases
- text categorization
- multi label
- text classification
- knn
- feature selection
- hidden markov models
- k nearest neighbor
- information gain
- feature weighting
- naive bayes
- automatic text categorization
- reuters corpus
- text documents
- document categorization
- automated text categorization
- tf idf
- term frequency
- text classifiers
- unlabeled data
- feature selection and classifier
- distributional clustering
- feature selection for text categorization
- document frequency
- term weighting
- text collections
- semi supervised
- decision trees