Multi-Type-TD-TSR - Extracting Tables from Document Images Using a Multi-stage Pipeline for Table Detection and Table Structure Recognition: From OCR to Structured Table Representations.
Pascal FischerAlen SmajicGiuseppe AbramiAlexander MehlerPublished in: KI (2021)
Keyphrases
- document images
- multistage
- document image analysis
- database
- optical character recognition
- document analysis
- text lines
- ocr systems
- machine learning
- reinforcement learning
- scanned documents
- character recognition
- page layout
- multi type
- page segmentation
- dynamic programming
- object recognition
- databases
- lot sizing
- relational data
- structured data
- handwriting recognition
- printed documents
- search space
- relational databases
- real world