Scan-to-XML: Automatic Generation of Browsable Technical Documents.
Ernest ValvenyBart LamiroyPublished in: ICPR (3) (2002)
Keyphrases
- xml documents
- xml format
- semi structured documents
- document structure
- metadata
- document centric
- extensible markup language
- xml data
- structured documents
- document repository
- semi structured
- information retrieval
- document collections
- document retrieval
- database
- xml queries
- content and structure
- semi structured data
- standard for data exchange
- xml databases
- free text
- relational databases
- data model
- xml schema
- information retrieval systems
- xml retrieval
- structured data
- electronic documents
- document management
- data exchange
- automatically generate
- retrieval systems
- object oriented
- document type
- xml files
- relevant documents
- web data
- data integration
- web documents
- query terms
- vector space model