CiT: Curation in Training for Effective Vision-Language Data.
Hu XuSaining XiePo-Yao HuangLicheng YuRussell HowesGargi GhoshLuke ZettlemoyerChristoph FeichtenhoferPublished in: CoRR (2023)
Keyphrases
- data collection
- high quality
- data sets
- data analysis
- database
- data processing
- knowledge discovery
- real time
- data distribution
- data mining techniques
- data points
- end users
- data structure
- image processing
- original data
- language learning
- vision system
- machine learning
- statistical analysis
- training dataset
- hidden markov models
- data quality
- raw data
- multimedia data
- databases
- high dimensional data
- bayesian networks
- programming language
- image data
- digital libraries
- relational databases