A Novel Feature Selection Approach Based on Document Frequency of Segmented Term Frequency.
Hongfang ZhouShuang HanYibin LiuPublished in: IEEE Access (2018)
Keyphrases
- document frequency
- term frequency
- feature selection
- text categorization
- information gain
- text classification
- tf idf
- retrieval model
- average precision
- bag of words
- document representation
- text documents
- n gram
- term weighting
- mutual information
- information retrieval
- retrieved documents
- support vector machine
- language model
- test collection
- vector space model
- semantic information
- vector space
- naive bayes
- query terms
- document collections
- labeled data
- information extraction
- decision trees
- high dimensional
- similarity measure