VTLayout: Fusion of Visual and Text Features for Document Layout Analysis.
Shoubin LiXuyan MaShuaiqun PanJun HuLin ShiQing WangPublished in: CoRR (2021)
Keyphrases
- low level
- textual features
- textual content
- keywords
- automatically extracted
- information retrieval
- text documents
- document processing
- web documents
- semantic space
- document content
- text retrieval
- semantic information
- feature extraction
- digital documents
- data fusion
- lexical features
- information retrieval systems
- document retrieval
- multiple features
- structured documents
- multimedia documents
- document analysis
- visual appearance
- document structure
- automatic text classification
- text collections
- document representation
- document images
- visual features
- image classification
- search engine