A Survey on Image-text Multimodal Models.
Ruifeng GuoJingxuan WeiLinzhuang SunBihui YuGuiyong ChangDawei LiuSibo ZhangZhengbing YaoMingjun XuLiping BuPublished in: CoRR (2023)
Keyphrases
- input image
- image data
- image analysis
- image classification
- image content
- single image
- image retrieval
- image statistics
- image features
- multiscale
- image representation
- high resolution
- low level
- image pixels
- visual effects
- information retrieval
- statistical model
- text retrieval
- template matching
- test images
- image segmentation
- spatial information
- feature points
- bayesian framework
- probabilistic model
- natural images
- textual information
- web images
- feature vectors
- text information
- keywords