Retrieval of Article-pages, Using Geometric Layout Relationships among Document Components.
Satoshi YokoiHong YanToyohide WatanabePublished in: IIMSS (2013)
Keyphrases
- retrieval systems
- page layout
- information retrieval
- document retrieval
- cf loadingtexthtml
- information retrieval systems
- document image retrieval
- web documents
- structured documents
- multimedia documents
- search engine
- document layout
- retrieval engine
- document analysis
- keywords
- related documents
- document images
- website
- content similarity
- document collections
- retrieval model
- topic distillation
- document content
- query specific
- retrieval quality
- textual content
- relevant documents
- trec web
- image database
- effective retrieval
- web pages
- document ranking
- document structure
- index terms
- query expansion
- trec genomics
- document type
- image retrieval
- text retrieval
- digital libraries
- test collection
- relevance feedback
- content and structure
- query terms
- query independent
- retrieval strategies
- printed text
- scanned documents
- semantic content
- tf idf
- term frequency
- multimedia
- text documents
- link structure
- inter document similarities
- handwritten documents
- ad hoc retrieval