VideoXum: Cross-Modal Visual and Textural Summarization of Videos.
Jingyang LinHang HuaMing ChenYikang LiJenhao HsiaoChiuman HoJiebo LuoPublished in: IEEE Trans. Multim. (2024)
Keyphrases
- cross modal
- multi modal
- visual data
- video search
- visual similarity
- multimedia retrieval
- visual recognition
- image retrieval
- perceptual information
- multimedia databases
- video data
- visual features
- visual information
- video content
- web images
- computer vision
- video frames
- video sequences
- information retrieval
- semantic concepts
- video analysis
- image search
- multimedia data
- keywords