Hi-Fi HTML rendering of multi-format documents in DoMinUS.
Stefano FerilliFloriana EspositoDomenico RedavidPublished in: ACM Symposium on Document Engineering (2013)
Keyphrases
- extensible markup language
- metadata
- plain text
- xml format
- electronic documents
- document type
- human readable
- file formats
- relevant documents
- web documents
- document structure
- pdf documents
- high quality
- xml documents
- information retrieval
- document collections
- xml files
- structured documents
- document classification
- real time
- web pages
- digital libraries
- information retrieval systems
- pdf files
- web browser
- d scene
- legal documents
- document clustering
- free text
- user interface
- computer graphics
- high fidelity
- html documents
- database
- data interchange
- information extraction
- ranked list
- vector space
- text documents
- user queries
- text mining
- document representation
- test collection
- digital documents
- keywords
- html pages
- query terms
- multimedia
- machine learning
- retrieval systems