Automatic Extraction of Domain-Specific Stopwords from Labeled Documents.
Masoud MakrehchiMohamed S. KamelPublished in: ECIR (2008)
Keyphrases
- automatic extraction
- labeled documents
- domain specific
- stop words
- text classification
- text classifiers
- preprocessing step
- tf idf
- training data
- text categorization
- information gain
- text documents
- text mining
- naive bayes
- document classification
- dimensionality reduction
- term frequency
- keywords
- search engine
- labeled data
- semi supervised learning
- machine learning
- bag of words
- knn
- knowledge base