Topic difference factor extraction between two document sets and its application to text categorization.
Takahiko KawataniPublished in: SIGIR (2002)
Keyphrases
- text categorization
- document set
- term frequency
- text classification
- document clustering
- text documents
- relevant documents
- feature selection
- tf idf
- knn
- test collection
- document collections
- information extraction
- semi supervised learning
- k nearest neighbor
- keywords
- text summarization
- information retrieval
- document retrieval
- text data
- artificial intelligence