Unaligning Everything: Or Aligning Any Text to Any Image in Multimodal Models.
Shaeke SalmanMd Montasir Bin ShamsXiuwen LiuPublished in: CoRR (2024)
Keyphrases
- image features
- input image
- image data
- image content
- multiscale
- single image
- bayesian framework
- random fields
- image analysis
- low level
- feature points
- template matching
- image pixels
- parametric models
- image regions
- image statistics
- image segmentation
- multi modal
- image classification
- information retrieval
- high resolution
- image retrieval
- segmentation algorithm
- statistical model
- keywords
- test images
- region of interest
- probabilistic model
- visual effects
- bounding box
- object recognition
- markov random field
- hough transform
- medical images
- image representation