Login / Signup

Semantically consistent Video-to-Audio Generation using Multimodal Language Large Model.

Gehui ChenGuan'an WangXiaowen HuangJitao Sang
Published in: CoRR (2024)
Keyphrases