Deep Structured Feature Networks for Table Detection and Tabular Data Extraction from Scanned Financial Document Images.
Siwen LuoMengting WuYiwen GongWanying ZhouJosiah PoonPublished in: CoRR (2021)
Keyphrases
- document images
- data extraction
- semi structured
- page segmentation
- document image analysis
- document analysis
- structured data
- line extraction
- text lines
- web data extraction
- scanned documents
- document image understanding
- web pages
- document processing
- data integration
- scanned document images
- optical character recognition
- printed documents
- database
- word spotting
- scanned images
- query interface
- information extraction
- historical documents
- database systems
- decision making
- machine learning
- databases
- page layout
- printed text
- document image retrieval
- data sets