Improving Multi-label Document Classification of Czech News Articles.
Jan LeheckaJan SvecPublished in: TSD (2015)
Keyphrases
- document classification
- news articles
- multi label
- text documents
- text categorization
- text classification
- statistical topic models
- knn
- feature selection
- text mining
- document clustering
- k nearest neighbor
- bag of words
- image classification
- graph cuts
- unlabeled data
- naive bayes
- machine learning
- semi supervised learning
- named entities
- labeled data
- class labels
- topic models
- model selection
- information extraction
- object recognition
- training data
- similarity measure
- image processing
- information retrieval