Co-clustering based classification for out-of-domain documents.
Wenyuan DaiGui-Rong XueQiang YangYong YuPublished in: KDD (2007)
Keyphrases
- document classification
- classification accuracy
- information retrieval
- feature extraction
- support vector
- pattern recognition
- machine learning
- feature vectors
- domain specific
- automatic categorization
- domain independent
- classification algorithm
- supervised learning
- support vector machine svm
- information retrieval systems
- document categorization
- classification scheme
- pre classified
- text classification
- feature selection
- model selection
- metadata
- automatic classification
- document clustering
- training set
- class labels
- benchmark datasets
- document collections
- training samples
- image classification
- web documents
- xml documents
- clustering method
- relevant documents
- text documents
- classification models
- digital libraries
- vector space model
- natural language processing
- keywords
- text classifiers
- decision trees