A Semi-automatic Adaptive OCR for Digital Libraries.
Sachin RawatK. S. Sesh KumarMillion MesheshaIndraneel Deb SikdarA. BalasubramanianC. V. JawaharPublished in: Document Analysis Systems (2006)
Keyphrases
- semi automatic
- digital libraries
- document image analysis
- fully automatic
- document processing
- domain ontology
- gold standard
- semi automatically
- document images
- metadata
- optical character recognition
- preprocessing
- advanced technology
- semantic annotation
- post processing
- multimedia
- ontology mapping
- design rationale
- ontology construction
- error correction
- wrapper generation
- ontology engineering
- manual annotation
- knowledge extraction
- information retrieval
- natural language
- search engine