Sign in
Learning Audio-Video Modalities from Image Captions.
Arsha Nagrani
Paul Hongsuck Seo
Bryan Seybold
Anja Hauth
Santiago Manen
Chen Sun
Cordelia Schmid
Published in:
CoRR (2022)
Keyphrases
</>
image data
audio video
online learning
image classification
multiscale
image retrieval
learning process
mobile learning