Model based table cell detection and content extraction from degraded document images.
Zhixin ShiSrirangaraj SetlurVenu GovindarajuPublished in: DAR@ICVGIP (2012)
Keyphrases
- document images
- content extraction
- html documents
- ocr systems
- document analysis
- document image analysis
- optical character recognition
- text content
- page segmentation
- digital archives
- database
- historical documents
- scanned documents
- document image retrieval
- page layout
- automatic extraction
- web documents
- printed documents
- multimedia information retrieval
- semantic information
- word spotting