Conditional Video Diffusion Network for Fine-Grained Temporal Sentence Grounding.

Published in: IEEE Trans. Multim. (2024)

Keyphrases