Towards Generalisable Video Moment Retrieval: Visual-Dynamic Injection to Image-Text Pre-Training.

Published in: CoRR (2023)

Keyphrases