CED: Catalog Extraction from Documents.
Tong ZhuGuoliang ZhangZechang LiZijian YuJunfei RenMengsong WuZhefeng WangBaoxing HuaiPingfu ChaoWenliang ChenPublished in: ICDAR (3) (2023)
Keyphrases
- web documents
- information retrieval
- document collections
- information retrieval systems
- text documents
- metadata
- relevant documents
- xml documents
- document retrieval
- document clustering
- digital documents
- information extraction
- image processing
- vector space model
- textual content
- legal documents
- user queries
- keywords
- document representation
- probabilistic model
- automatically extracted
- document content