Effects of term weighting approach with and without stop words removing on Arabic text classification.
Esra'a AlhenawiRuba Abu KhurmaPedro A. CastilloMaribel García ArenasPublished in: CoRR (2024)
Keyphrases
- term weighting
- text classification
- stop words
- tf idf
- text categorization
- term frequency
- text documents
- language modeling
- bag of words
- information gain
- feature selection
- n gram
- text mining
- information retrieval
- text data
- text retrieval
- machine learning
- naive bayes
- knn
- unlabeled data
- document frequency
- labeled data
- document clustering
- data mining
- vector space model
- ranking algorithm
- retrieval model
- language model
- nearest neighbor
- semi supervised learning
- bayesian networks
- similarity measure
- multimedia
- retrieval systems
- neural network