MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text Retrieval.

Published in: CoRR (2022)

Keyphrases