Login / Signup
Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning.
Saurabhchand Bhati
Jesús Villalba
Laureano Moro-Velázquez
Thomas Thebaud
Najim Dehak
Published in:
CoRR (2023)
Keyphrases
</>
visual learning
input image
multiscale
image features
image segmentation
text graphics
image retrieval
feature points
image representation
probabilistic model
image classification
denoising
test images
similarity measure
high resolution
signal processing