ProcTag: Process Tagging for Assessing the Efficacy of Document Instruction Data.
Yufan ShenChuwei LuoZhaoqing ZhuYang ChenQi ZhengZhi YuJiajun BuCong YaoPublished in: CoRR (2024)
Keyphrases
- data sets
- database
- data analysis
- data collection
- synthetic data
- data processing
- computer systems
- raw data
- probability distribution
- image data
- high quality
- statistical analysis
- redundant data
- document images
- sensor data
- knowledge discovery
- data mining
- data mining techniques
- missing data
- test collection
- process model
- spatial data
- end users
- data sources
- data quality
- prior knowledge