An Augmentation Strategy for Visually Rich Documents.
Jing XieJames B. WendtYichao ZhouSeth EbnerSandeep TataPublished in: CoRR (2022)
Keyphrases
- information retrieval
- web documents
- xml documents
- document collections
- document classification
- metadata
- information retrieval systems
- document retrieval
- web data
- free text
- digital documents
- data sets
- plagiarism detection
- expert finding
- structured documents
- document clustering
- text documents
- selection strategy
- retrieval strategies
- web pages