HiT: Hierarchical Transformer with Momentum Contrast for Video-Text Retrieval.
Song LiuHaoqi FanShengsheng QianYiru ChenWenkui DingZhongyuan WangPublished in: CoRR (2021)
Keyphrases
- text retrieval
- video sequences
- information retrieval
- video data
- multimedia
- image retrieval
- cross language
- document retrieval
- document collections
- keyword extraction
- multimedia information retrieval
- multimedia retrieval
- retrieval model
- retrieval systems
- inverted file
- latent semantic indexing
- query expansion
- video content
- video retrieval
- vector space
- space time
- retrieval quality
- automatic query expansion
- video shots
- retrieval effectiveness
- video frames
- feature vectors
- blind relevance feedback