Multimodal C4: An Open, Billion-scale Corpus of Images Interleaved with Text.
Wanrong ZhuJack HesselAnas AwadallaSamir Yitzhak GadreJesse DodgeAlex FangYoungjae YuLudwig SchmidtWilliam Yang WangYejin ChoiPublished in: NeurIPS (2023)
Keyphrases
- image analysis
- image data
- input image
- image collections
- image retrieval
- ground truth
- image registration
- image database
- three dimensional
- web images
- image classification
- image features
- object recognition
- text detection
- broad coverage
- edge detection
- text information
- textual information
- image set
- historical manuscripts
- multi modal
- text mining
- image annotation
- low level features
- image search
- text data
- image regions
- natural language text
- feature points
- text corpus
- co occurrence
- newspaper articles
- multiple modalities
- information retrieval