A Benchmark for Structured Extractions from Complex Documents.
Zilong WangYichao ZhouWei WeiChen-Yu LeeSandeep TataPublished in: CoRR (2022)
Keyphrases
- real world
- web documents
- information retrieval
- metadata
- complex data
- neural network
- information retrieval systems
- resource intensive
- document classification
- complex systems
- structured data
- document collections
- xml documents
- keywords
- text classification
- user queries
- retrieval systems
- text documents
- document retrieval
- document clustering
- document analysis
- database