RanLayNet: A Dataset for Document Layout Detection used for Domain Adaptation and Generalization.
Avinash AnandRaj JaiswalMohit GuptaSiddhesh S. BangarPijush BhuyanNaman LalRajeev SinghRitika JhaRajiv Ratn ShahShin'ichi SatohPublished in: CoRR (2024)
Keyphrases
- domain adaptation
- document layout
- structure extraction
- labeled data
- cross domain
- document classification
- semi supervised
- semi supervised learning
- sentiment classification
- transfer learning
- target domain
- multiple sources
- test data
- page segmentation
- document analysis
- data sets
- pairwise
- co training
- unlabeled data
- wordnet
- active learning
- relational databases
- machine learning