Multimodal Tree Decoder for Table of Contents Extraction in Document Images.
Pengfei HuZhenrong ZhangJianshu ZhangJun DuJiajia WuPublished in: CoRR (2022)
Keyphrases
- document images
- table of contents
- line extraction
- document image analysis
- document analysis
- tree structure
- optical character recognition
- printed documents
- page layout
- document image understanding
- scanned documents
- database marketing
- sql server
- document image retrieval
- index structure
- historical documents
- page segmentation
- scanned document images
- oracle database
- b tree
- information extraction
- multimedia
- binary images
- data structure
- search engine
- databases