Keyphrases
- document representation
- bag of words
- document collections
- document clustering
- vector space model
- language model
- data fusion
- web documents
- vector space
- text documents
- semantic information
- unsupervised learning
- supervised learning
- document categorization
- text data
- machine learning
- document content
- information retrieval systems
- semi supervised
- digital libraries
- background knowledge
- image representation
- image classification
- wordnet
- text classification
- active learning
- high dimensional