Sign in

Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning.

Antoine YangArsha NagraniPaul Hongsuck SeoAntoine MiechJordi Pont-TusetIvan LaptevJosef SivicCordelia Schmid
Published in: CVPR (2023)
Keyphrases