Document Representation and Multilevel Measures of Document Similarity.
Irina MatveevaPublished in: HLT-NAACL (2006)
Keyphrases
- document representation
- document similarity
- bag of words
- document clustering
- vector space model
- document collections
- index terms
- data fusion
- semantic information
- language model
- text documents
- vector space
- web documents
- text classification
- evaluation measures
- information retrieval
- image representation
- action recognition
- background knowledge
- text categorization
- image classification
- principal component analysis
- object recognition