Text Categorization Can Enhance Domain-Agnostic Stopword Extraction.
Houcemeddine TurkiNaome A. EtoriMohamed Ali Hadj TaiebAbdul-Hakeem OmotayoChris Chinenye EmezueMohamed Ben AouichaAyodele AwokoyaFalalu Ibrahim LawanDoreen NixdorfPublished in: CoRR (2024)
Keyphrases
- text categorization
- cross domain
- knn
- text classification
- multi label
- feature selection
- information gain
- k nearest neighbor
- feature weighting
- text documents
- semi supervised learning
- text classifiers
- automated text categorization
- reuters corpus
- information extraction
- document classification
- document categorization
- term weighting
- feature selection for text categorization
- feature selections
- data sets
- tf idf
- naive bayes
- text collections
- term frequency
- unlabeled data
- automatic text categorization
- semi supervised
- multi instance multi label learning
- support vector machine
- pairwise
- machine learning