Login / Signup
Semantically consistent Video-to-Audio Generation using Multimodal Language Large Model.
Gehui Chen
Guan'an Wang
Xiaowen Huang
Jitao Sang
Published in:
CoRR (2024)
Keyphrases
</>
multimedia
probabilistic model
mathematical model
objective function
video sequences
computational model
real time
neural network
natural language
probability distribution
object oriented
programming language
audio visual