Document Domain Randomization for Deep Learning Document Layout Extraction.
Meng LingJian ChenTorsten MöllerPetra IsenbergTobias IsenbergMichael SedlmairRobert S. LarameeHan-Wei ShenJian WuC. Lee GilesPublished in: CoRR (2021)
Keyphrases
- deep learning
- document layout
- structure extraction
- document images
- document structure
- unsupervised learning
- document analysis
- page segmentation
- information retrieval systems
- document image analysis
- machine learning
- document retrieval
- weakly supervised
- web documents
- information retrieval
- document clustering
- retrieval systems
- generative model
- supervised learning
- information extraction