Video-Music Retrieval with Fine-Grained Cross-Modal Alignment.
Yuki EraRen TogoKeisuke MaedaTakahiro OgawaMiki HaseyamaPublished in: ICIP (2023)
Keyphrases
- fine grained
- cross modal
- music retrieval
- multi modal
- visual data
- music information retrieval
- video sequences
- video data
- access control
- image retrieval
- audio features
- multimedia
- semantic concepts
- video frames
- video content
- multimedia databases
- video analysis
- key frames
- audio signal
- document retrieval
- visual information
- contextual information
- semantic features