A Hierarchical Multi-Modal Encoder for Moment Localization in Video Corpus.
Bowen ZhangHexiang HuJoonseok LeeMing ZhaoSheide ChammasVihan JainEugene IeFei ShaPublished in: CoRR (2020)
Keyphrases
- multi modal
- video search
- semantic concepts
- video sequences
- video data
- multiple modalities
- multi modality
- video streams
- audio visual
- video content
- image annotation
- video encoder
- multimedia
- video analysis
- video database
- video frames
- video clips
- key frames
- spatial and temporal
- cross modal
- uni modal
- visual data
- video retrieval
- bit rate
- motion estimation