Learning Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval.
Niluthpol Chowdhury MithunJuncheng LiFlorian MetzeAmit K. Roy-ChowdhuryPublished in: ICMR (2018)
Keyphrases
- text retrieval
- cross modal
- multimedia retrieval
- multimedia
- visual recognition
- learning process
- learning algorithm
- multi modal
- document retrieval
- multimedia information retrieval
- document collections
- video data
- image retrieval
- query processing
- probabilistic model
- similarity search
- space time
- retrieval systems
- video frames
- object recognition
- video sequences
- machine learning