Revisiting Table Detection Datasets for Visually Rich Documents.
Bin XiaoMurat SimsekBurak KantarciAla Abu AlkheirPublished in: CoRR (2023)
Keyphrases
- database
- information retrieval systems
- information retrieval
- document classification
- automatic detection
- detection accuracy
- object detection
- document retrieval
- document collections
- detection algorithm
- false alarms
- web documents
- text documents
- metadata
- vector space model
- benchmark datasets
- false positives
- event detection
- data collections
- detection rate
- real world
- text collections
- synthetic datasets
- plain text
- document clustering
- query terms
- text categorization
- multimedia
- data sets