TiMix: Text-aware Image Mixing for Effective Vision-Language Pre-training.
Chaoya JiangWei YeHaiyang XuQinghao YeMing YanJi ZhangShikun ZhangPublished in: CoRR (2023)
Keyphrases
- image data
- image content
- single image
- image analysis
- image features
- image segmentation
- image classification
- input image
- similarity measure
- image retrieval
- edge detection
- low level image processing
- image collections
- image regions
- segmentation method
- programming language
- low level
- high resolution
- scanned documents
- information retrieval
- keywords
- image processing
- english text
- computer vision
- text retrieval
- web images
- handwritten documents
- text information
- language generation
- pixel values
- image set
- test images
- document images
- spatial information
- vision system
- image database
- training set