Document Domain Randomization for Deep Learning Document Layout Extraction.
Meng LingJian ChenTorsten MöllerPetra IsenbergTobias IsenbergMichael SedlmairRobert S. LarameeHan-Wei ShenJian WuC. Lee GilesPublished in: ICDAR (1) (2021)
Keyphrases
- deep learning
- document layout
- structure extraction
- document images
- unsupervised learning
- document structure
- document analysis
- machine learning
- document image analysis
- weakly supervised
- mental models
- document retrieval
- information retrieval systems
- web documents
- relevant documents
- domain specific
- information retrieval
- image classification
- automatic extraction
- dimensionality reduction
- information extraction
- pattern recognition
- keywords
- image processing
- feature selection
- page segmentation
- computer vision