Structural extraction from visual layout of documents.
Binyamin RosenfeldRonen FeldmanYonatan AumannPublished in: CIKM (2002)
Keyphrases
- information retrieval
- page layout
- document collections
- information extraction
- free text
- information retrieval systems
- visual features
- visual information
- xml documents
- text documents
- metadata
- legal documents
- semantic content
- structured documents
- web documents
- keywords
- digital libraries
- multimedia documents
- structural analysis
- document analysis
- automatic extraction
- structural information
- document retrieval
- document classification
- electronic documents
- spatial layout
- user queries
- query terms
- relevant documents
- document image retrieval
- website
- document set
- latent semantic analysis
- automatically extracted
- web images
- knowledge extraction
- document representation
- vector space model
- vector space
- retrieval systems