CLIP the Bias: How Useful is Balancing Data in Multimodal Learning?
Ibrahim AlabdulmohsinXiao WangAndreas SteinerPriya GoyalAlexander D'AmourXiaohua ZhaiPublished in: CoRR (2024)
Keyphrases
- data sets
- image data
- background knowledge
- data processing
- data collection
- knowledge discovery
- learning algorithm
- data quality
- synthetic data
- small number
- raw data
- prior knowledge
- data analysis
- input data
- training data
- database
- data distribution
- knowledge acquisition
- supervised learning
- learning process
- data sources
- multi modal
- human experts
- network structure
- hidden variables
- learning systems
- statistical analysis
- computer systems
- probability distribution
- data model