A Dataset and Strong Baselines for Classification of Czech News Texts.
Hynek KydlícekJindrich LibovickýPublished in: TSD (2023)
Keyphrases
- benchmark datasets
- classification accuracy
- decision trees
- feature extraction
- automatic classification
- text classification
- pattern recognition
- pattern classification
- automatically generated
- classification method
- classification algorithm
- classification scheme
- keywords
- support vector machine svm
- training samples
- uci datasets
- image classification
- short texts
- support vector machine
- training dataset
- support vector
- preprocessing
- broadcast news
- manually generated
- classification rules
- machine learning methods
- class labels
- machine learning algorithms
- model selection
- natural language processing
- supervised learning
- feature space