Using value-added document representations in INEX.
Birger LarsenHaakon LundJacob K. AndresenPeter IngwersenPublished in: INEX (2003)
Keyphrases
- document representation
- bag of words
- document collections
- vector representation
- document clustering
- semantically enhanced
- xml retrieval
- document retrieval
- web documents
- data fusion
- vector space model
- language model
- vector space
- text documents
- document structure
- semantic information
- anchor text
- structured documents
- image representation
- test collection
- computer vision
- text classification
- image classification
- probabilistic model
- prior knowledge
- feature extraction
- clustering algorithm