Learning text-to-video retrieval from image captioning.
Lucas VenturaCordelia SchmidGül VarolPublished in: CoRR (2024)
Keyphrases
- video retrieval
- content based image
- video search
- image data
- image features
- semantic gap
- image retrieval
- multiscale
- image representation
- key frames
- active learning
- information retrieval
- image content
- semantic information
- semantic video retrieval
- video collections
- content based retrieval
- similarity measure
- image processing