Learning Video-Text Aligned Representations for Video Captioning.

Published in: ACM Trans. Multim. Comput. Commun. Appl. (2023)

Keyphrases