Joint embeddings with multimodal cues for video-text retrieval.

Niluthpol Chowdhury Mithun Juncheng Li Florian Metze Amit K. Roy-Chowdhury

Published in: Int. J. Multim. Inf. Retr. (2019)