Discriminative Category Matching: Efficient Text Classification for Huge Document Collections.
Gabriel Pui Cheong FungJeffrey Xu YuHongjun LuPublished in: ICDM (2002)
Keyphrases
- document collections
- text classification
- text data
- information retrieval systems
- document retrieval
- topic detection
- feature selection
- information retrieval
- text retrieval
- text categorization
- data collections
- text collections
- test collection
- digital libraries
- bag of words
- scatter gather
- document clustering
- text corpora
- document representation
- document archives
- relevant documents
- n gram
- unlabeled data
- labeled data
- information extraction
- data analysis
- learning algorithm