Login / Signup

VIMI: Grounding Video Generation through Multi-modal Instruction.

Yuwei FangWilli MenapaceAliaksandr SiarohinTsai-Shien ChenKuan-Chien WangIvan SkorokhodovGraham NeubigSergey Tulyakov
Published in: CoRR (2024)
Keyphrases