Improving Interpretable Embeddings for Ad-hoc Video Search with Generative Captions and Multi-word Concept Bank.
Jiaxin WuChong-Wah NgoWing-Kwong ChanPublished in: CoRR (2024)
Keyphrases
- video search
- multiword
- image search
- visual content
- video content
- multi modal
- visual features
- video retrieval
- semantic concepts
- semantic content
- context sensitive
- multimedia
- video shots
- image collections
- visual information
- image annotation
- low dimensional
- image content
- high dimensional data
- video data
- visual concepts
- image retrieval
- feature extraction