Improving Arabic Text Categorization Using Transformer Training Diversification.
Shammur Absar ChowdhuryAhmed AbdelaliKareem DarwishSoon-Gyo JungJoni SalminenBernard J. JansenPublished in: WANLP@COLING (2020)
Keyphrases
- text categorization
- linear svm
- text classifiers
- feature selection
- text classification
- feature selection and classifier
- knn
- multi label
- k nearest neighbor
- naive bayes
- text documents
- reuters corpus
- automated text categorization
- automatic text categorization
- feature weighting
- text collections
- information gain
- document categorization
- semi supervised learning
- term weighting
- supervised learning
- feature selection for text categorization
- semantic browsing
- tf idf
- training documents
- document frequency
- data sets
- training set
- neural network