Retrieval by Layout Similarity of Documents Represented with MXY Trees.
Francesca CesariniSimone MarinaiGiovanni SodaPublished in: Document Analysis Systems (2002)
Keyphrases
- content similarity
- information retrieval systems
- information retrieval
- document retrieval
- retrieval systems
- text queries
- structured documents
- document image retrieval
- document indexing
- multimedia documents
- document analysis
- retrieval process
- text retrieval
- query terms
- document collections
- semantic similarity
- index terms
- document similarity
- similarity measure
- cosine similarity
- document content
- similar documents
- web documents
- heterogeneous collections
- similarity measurement
- document structure
- relevant documents
- query expansion
- xml documents
- documents retrieved
- expert finding
- retrieval strategies
- semantic content
- retrieved documents
- retrieval model
- page layout
- similarity retrieval
- test collection
- relevance feedback
- image database
- image retrieval
- document space
- vector space model
- document ranking
- effective retrieval
- related documents
- inter document similarities
- retrieval engine
- text collections
- document level
- language model
- search engine
- relevance assessments
- distributed information retrieval
- metadata
- decision trees
- text documents
- similarity estimation
- user queries
- multimedia
- digital libraries
- distance measure
- retrieval effectiveness
- ranked list
- content based retrieval
- term frequency
- boolean queries