A term weighting scheme based on the measure of relevance and distinction for text categorization.
Jieming YangJing WangZhiying LiuZhaoyang QuPublished in: SNPD (2015)
Keyphrases
- text categorization
- term weighting schemes
- term weighting
- tf idf
- term frequency
- term weights
- text classification
- feature selection
- knn
- text documents
- information retrieval
- precision recall
- k nearest neighbor
- similarity measure
- correlation coefficient
- semi supervised learning
- unlabeled data
- cross domain
- weighting scheme
- test collection
- distance measure
- scoring function
- retrieved documents
- evaluation measures
- semi supervised
- vector space model
- retrieval effectiveness
- nearest neighbor
- labeled data
- web search