PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents.
Junjie WangYin ZhangYatai JiYuxiang ZhangChunyang JiangYubo WangKang ZhuZekun WangTiezhen WangWenhao HuangJie FuBei ChenQunshu LinMinghao LiuGe ZhangWenhu ChenPublished in: CoRR (2024)
Keyphrases
- higher education
- knowledge intensive
- knowledge acquisition
- information retrieval
- document collections
- metadata
- law enforcement
- relevant documents
- information retrieval systems
- human resources
- knowledge management
- software development
- xml documents
- multi modal
- workflow management
- database
- vector space model
- web documents
- text documents
- document retrieval
- document clustering
- keywords
- knowledge workers
- law enforcement officers
- retrieval systems
- multimedia
- decision making
- artificial intelligence