Automatically generating high quality metadata by analyzing the document code of common file types.
Lars Fredrik Høimyr EdvardsenIngeborg SølvbergTrond AalbergHallvard TrættebergPublished in: JCDL (2009)
Keyphrases
- automatically generating
- high quality
- metadata
- automatically generated
- digital documents
- database
- digital libraries
- multimedia documents
- document classification
- electronic documents
- document images
- information retrieval
- web documents
- information retrieval systems
- source code
- document collections
- semantic information
- retrieval systems
- data warehouse
- english words
- multimedia
- low quality
- signature file
- dublin core
- short list
- semantic content
- relevant documents
- learning resources
- test collection
- high resolution
- keywords
- databases