Start from Video-Music Retrieval: An Inter-Intra Modal Loss for Cross Modal Retrieval.
Zeyu ChenPengfei ZhangKai YeWei DongXin FengYana ZhangPublished in: CoRR (2024)
Keyphrases
- cross modal
- music retrieval
- multi modal
- visual data
- multimedia retrieval
- audio features
- music information retrieval
- multimedia
- multimedia databases
- video data
- image retrieval
- video sequences
- multimedia data
- visual similarity
- audio visual
- video search
- semantic concepts
- video content
- audio signal
- video frames
- semantic features
- low level
- image sequences
- key frames
- visual information
- text categorization
- visual features
- object recognition