Official Document Identification and Data Extraction using Templates and OCR.
Cosmin IrimiaFlorin HarbuzariuIonut HaziAdrian IftenePublished in: KES (2022)
Keyphrases
- data extraction
- document images
- document processing
- semi structured
- html pages
- scanned documents
- web data extraction
- printed documents
- optical character recognition
- data integration
- document analysis
- information extraction
- web documents
- document image retrieval
- information retrieval
- query interface
- character recognition
- web pages
- document collections
- html documents
- retrieval systems
- digital libraries
- distributed databases
- relevant documents
- databases