Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters.
Weizhi WangKhalil MriniLinjie YangSateesh KumarYu TianXifeng YanHeng WangPublished in: CoRR (2024)
Keyphrases
- language model
- text data
- language modeling
- image data
- text classification
- image features
- input image
- text mining
- n gram
- probabilistic model
- document representation
- multiscale
- test collection
- information retrieval
- retrieval model
- image classification
- high dimensional
- image representation
- document retrieval
- image retrieval
- semi supervised
- information extraction
- unsupervised learning
- data model
- feature vectors
- bag of words
- multimedia
- data mining
- smoothing methods