Classifying Documents Without Labels.
Daniel BarbaráCarlotta DomeniconiNing KangPublished in: SDM (2004)
Keyphrases
- document retrieval
- document collections
- information retrieval
- document clustering
- document classification
- relevant documents
- text documents
- information retrieval systems
- pre classified
- legal documents
- document content
- document representation
- web documents
- retrieval systems
- supervised learning
- machine learning
- xml documents
- training examples
- structured documents
- user queries
- text categorization
- training documents
- semantic classes
- document set
- automatic classification
- vector space model
- metadata
- keywords
- vector space
- multi label