Multimodal C4: An Open, Billion-scale Corpus of Images Interleaved With Text.
Wanrong ZhuJack HesselAnas AwadallaSamir Yitzhak GadreJesse DodgeAlex FangYoungjae YuLudwig SchmidtWilliam Yang WangYejin ChoiPublished in: CoRR (2023)
Keyphrases
- image data
- image database
- image retrieval
- textual information
- object recognition
- image analysis
- image features
- multiple modalities
- test images
- text retrieval
- input image
- three dimensional
- image classification
- image collections
- ground truth
- image structure
- image processing
- complex background
- text mining
- keywords
- information retrieval
- segmentation algorithm
- multi modal
- image set
- scale space
- newspaper articles
- text information
- sentence level
- english words
- web images
- text data
- image annotation
- text documents
- image matching
- semantic information