CCC: Chinese Commercial Contracts Dataset for Documents Layout Understanding.
Shu LiuYongnan JinHarry LuShangqing ZhaoMan LanYuefeng ChenHao YuanPublished in: NLPCC (2) (2023)
Keyphrases
- document collections
- information retrieval
- keyword extraction
- information retrieval systems
- relevant documents
- database
- metadata
- xml documents
- web documents
- user queries
- document retrieval
- document classification
- page layout
- document clustering
- structured documents
- chinese text
- vector space model
- text fragments
- multi document summarization
- text documents
- document image retrieval