Understanding the Multi-modal Prompts of the Pre-trained Vision-Language Model.
Shuailei MaChen-Wei XieYing WeiSiyang SunJiaqi FanXiaoyi BaoYuxin GuoYun ZhengPublished in: CoRR (2023)
Keyphrases
- multi modal
- language model
- pre trained
- language modeling
- n gram
- speech recognition
- probabilistic model
- information retrieval
- mixture model
- retrieval model
- training data
- test collection
- computer vision
- query expansion
- smoothing methods
- high dimensional
- audio visual
- training examples
- image annotation
- translation model
- machine learning
- information retrieval systems
- dimensionality reduction