Login / Signup
XLIP: Cross-modal Attention Masked Modelling for Medical Language-Image Pre-Training.
Biao Wu
Yutong Xie
Zeyu Zhang
Minh Hieu Phan
Qi Chen
Ling Chen
Qi Wu
Published in:
CoRR (2024)
Keyphrases
</>
cross modal
image data
image retrieval
image features
multi modal
image content
multiscale
visual similarity
image collections
visual data
image segmentation
image classification
spatial information
test images
image representation
image understanding
video retrieval
multimedia retrieval
low level