CiT: Curation in Training for Effective Vision-Language Data.
Hu XuSaining XiePo-Yao HuangLicheng YuRussell HowesGargi GhoshLuke ZettlemoyerChristoph FeichtenhoferPublished in: ICCV (2023)
Keyphrases
- high quality
- data sets
- data analysis
- data collection
- raw data
- natural language
- data sources
- data processing
- data quality
- complex data
- test data
- experimental data
- synthetic data
- database
- training samples
- training process
- programming language
- labeled data
- input data
- image data
- data points
- high dimensional
- data structure
- decision trees
- databases
- real time