Evaluating Various Tokenizers for Arabic Text Classification.
Zaid AlyafeaiMaged Saeed AlShaibaniMustafa GhalebIrfan AhmadPublished in: CoRR (2021)
Keyphrases
- text classification
- bag of words
- text data
- text mining
- text categorization
- feature selection
- machine learning
- n gram
- labeled data
- naive bayes
- semantic features
- text classifiers
- arabic language
- neural network
- knn
- text documents
- document classification
- morphological analysis
- natural language processing
- sentiment analysis
- active learning
- sentiment classification