Improved baselines for vision-language pre-training.
Enrico FiniPietro AstolfiAdriana Romero-SorianoJakob VerbeekMichal DrozdzalPublished in: CoRR (2023)
Keyphrases
- computer vision
- training set
- programming language
- vision system
- natural language
- training examples
- language processing
- language learning
- training samples
- image processing
- high level
- formal language
- real time
- english language
- online learning
- supervised learning
- knowledge representation
- training data
- information systems