Using Wikipedia categories for compact representations of chemical documents.
Benjamin KöhnckeWolf-Tilo BalkePublished in: CIKM (2010)
Keyphrases
- compact representations
- information retrieval
- automatic text categorization
- training documents
- document collections
- document classification
- document retrieval
- metadata
- xml documents
- information retrieval systems
- text documents
- wikipedia pages
- free text
- web documents
- classify documents
- vector space
- document clustering
- probabilistic inference
- relevant documents
- web directories
- news items
- clustering algorithm
- vector space model
- document representation
- text classifiers
- drug discovery
- semantic information
- keywords